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Individual  differences  among  learners  constitute  an  important  class  of 
variables  for  research  on  instruction.  Their  study  has  been  of  interest  at 
least  since  Binet , because  measures  of  these  variables,  often  called  "apti- 
tudes", usually  predict  learning  outcome.  There  is  renewed  interest  in  this 
fact  today  because  aptitudes  now  often  appear  to  interact  with  instructional 
conditions, !• e. , to  relate  differently  to  learning  outcome  under  different 
instructional  treatments.  Practical  interest  stems  from  the  possibility 
that  such  interactions  can  be  used  to  adapt  instruction  to  fit  different 
learners  optimally,  since  previous  attempts  at  individualizing  instruction 
have  generally  failed  to  eliminate  individual  differences  in  learning  outcome. 
Such  interactions  are  of  theoretical  interest  because  they  demonstrate  con- 
struct validity  for  aptitude  and  learning  measures  in  a new  way,  and  raise 
the  possibility  that  common  processes  underlie  both  kinds  of  variables. 

They  suggest  that  neither  aptitude  constructs  nor  learning  processes  can  be 
fully  understood  without  reference  to  the  other. 

Aptitude-treatment  interactions  (ATI)  have  been  the  subject  of  many 
studies  in  recent  years,  and  a wide  variety  of  ATI  findings  are  now  on  hand. 
From  their  review  of  this  research,  Cronbach  and  Snow  (in  presc)  concluded 
that  the  existence  of  ATI  as  phenomena  has  been  clearly  established.  But, 
while  some  ATI  findings  are  plausible  and  some  are  replicable,  few  are  well 
understood  and  none  are  yet  applicable  to  Instructional  practice. 

The  volume  of  ATI  studies  can  be  arrayed  along  a continuum  from  labora- 
tory experiments  on  individual  differences  in  learning,  through  small-scale 
and  middle-range  instructional  experiments,  to  large  curriculum  evaluations, 
naturalistic  comparisons,  and  empirical  case  studies.  If  one  constructed 
a frequency  distribution  of  such  studies  along  this  continuum,  it  would 
probably  appear  more  or  less  normal  in  form;  most  studies  would  fall  in 
the  middle  range.  And,  the  same  form  of  distributior  might  be  expected  for 
instructional  experiments  in  general,  not  just  for  ATI  studies.  Thus,  the 
instructional  psychology  popular  today  consists  of  short-term  experiments 
with  a few  controlled  instructional  variables  aimed  at  testing  fairly  simple 
propositions.  These  studies  seek  compromise  between  the  need  for  instruc- 
tional relevance  and  the  need  for  experimental  control.  But  most  attain 
neither  the  descriptive  value  of  large-scale,  long-term  naturalistic  research 
nor  the  precision  and  process  analysis  of  the  laboratory;  inconsistencies 
abound  in  their  results.  Cronbach  and  Snow  (in  press)  concluded  that  the 
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middle-range  studies  were  leading  neither  to  theory  nor  to  generalizations 
useful  in  practice.  They  recommended  that  future  research  pay  increased 
attention  to: 

1)  The  examination  of  the  most  plausible  ATI  hypotheses  in  large 
scale,  long-duration,  real-school  studies.  This  would  allow  a consolidation 
of  efforts  to  establish  a few  ATI  hypotheses  in  settings  where  they  might 
actually  be  used.  The  emphasis  in  the  design  of  such  research  would  be  on 
representativeness  (Snow,  1974)  and  description  (Cronbach,  1975),  rather 
than  on  laboratory-like  control. 

2)  The  development  of  methodology  capable  of  handling  the  complexities 
of  such  research.  This  effort  would  deemphasize  the  familiar  significance 
testing  habits  of  researchers  in  favor  of  the  description  and  analysis  of 
complex  relationships  (Cronbach  & Snow,  in  press;  Cronbach  & Webb,  1975; 
Cronbach,  1976) . 

3)  The  development  of  a laboratory  science  for  the  analysis  of  aptitude 
tests  and  learning  tasks,  and  the  ATI  constructs  based  on  them  (Snow,  1976). 
This  would  complement  the  instructional  studies  with  process  analyses  to  pro- 
vide ideas  about  possible  underlying  mechanisms.  Embodied  in  newly  under- 
stood and/or  newly  designed  aptitude  measures,  these  ideas  might  then  be 
conveyed  to  research  in  the  real  instructional  settings  where  probable, 
practically  useful  ATI  can  be  examined  and  used. 

The  present  review  must  be  highly  selective  with  respect  to  each  of 
these  three  lines  of  continuing  research.  It  will  avoid  repeating  material 
available  in  the  above  cited  sources  wherever  possible.  Since  the  previous 
review  in  this  series  by  Berliner  and  Cahen  (1973)  and  the  Cronbach-Snow  book, 
many  new  ATI  studies  have  appeared.  No  attempt  will  be  made  here  to  catalog 
this  heterogeneous  collection;  only  studies  bearing  on  a few  major  ATI  hypo- 
theses and  methodological  developments  are  reviewed.  Suggestions  for  further 
research  are  made  along  the  way. 

Definitional  Matters 

Before  proceeding,  it  may  be  helpful  here  to  deal  with  several  defini- 
tional concerns  that  have  confused  thinking  and  writing  in  the  ATI  field. 

Some  of  these  are  touched  upon  in  Cronbach  and  Snow  (in  press);  some  are 
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The  first  concern  is  with  the  term  "aptitude"  itself.  Cronbach  and 
Snow  (in  press)  defined  aptitude  as  "...any  characteristic  of  a person 
that  forecasts  his  probability  of  success  under  a given  treatment"  (p.  6). 

This  purposely  broad  and  pragmatic  definition  was  meant  to  promote  a new  kind 
of  instructional  research.  Personality  variables,  biographical  and  other  non- 
test measures,  and  new  kinds  of  aptitude  constructs  might  predict  response  to 
instruction  in  a given  setting,  singly  or  in  combination,  and  hence  might  be 
thought  of  as  sources  of  aptitude  for  success  in  that  setting.  The  tradi- 
tional conception  of  aptitude  for  school  learning,  as  represented  exclusively 
by  "scholastic  aptitude  tests"  or  "general  ability  tests"  was  unnecessarily 
limiting.  Other  work  in  differential  psychology  has  developed  more  specialized 
multivariate  conceptions  of  aptitude  for  success  in  specialized  activities,  and 
these  span  the  artificial  distinctions  between  achievement,  ability,  person- 
ality, etc.:  mechanical  knowledge  is  one  aptitude  for  technical  work,  along  with 

various  psychomotor  skills;  some  esthetic  sensitivities,  along  with  drawing 
and  perceptual  skills,  are  aptitudes  in  art;  a certain  degree  of  compulsivity 
may  be  an  additional  aptitude  for  computer  programming!  even  height  is  an 
aptitude  for  basketball.  There  is  no  a priori  reason  why  aptitude  for  success 
in  instructional  activities  should  be  less  complex,  diversified,  or  specialized 
than  it  is  in  these  other  pursuits. 

Other  writers,  hoping  to  avoid  potential  confusions,  have  preferred  terms 
like  "attribute-treatment  interaction"  or  "trait-treatment  interaction".  But 
these  are  hollow  terms — empty  of  substantive  meaning.  "Aptitude",  on  the  other 
hand,  is  a substantive  concept  in  educational  psychology.  We  should  expect 
continued  research  to  alter  and  elaborate  the  meaning  of  such  concepts,  just 
as  it  alters  and  elaborates  concepts  of  instructional  method.  Further,  in- 
dividual aptitude  might  be  expected  to  develop  or  change  with  continued  ex- 
perience in  a given  kind  of  instruction;  "trait"  and  "attribute"  imply 
permanence.  Most  importantly,  we  can  hope  there  will  someday  be  a theory 
of  "aptitude";  there  can  never  be  a theory  of  "trait"  or  of  "attribute".  The 
present  writer  thus  persists  in  advocating  use  of  the  substantive  term.  Re- 
search on  aptitude  for  learning  is  the  study  of  individual  differences  in 
learning  and  learning-related  processes,  particularly  as  these  vary  and  covary 
under  different  instructional  conditions. 

A second  issue  arose  over  the  distinction  between  intrinsic  and  extrin- 
sic individual  differences  relevant  to  learning.  The  former  take  their 
definitions  directly  from  measures  of  learning  processes;  the  latter  do  not. 
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The  extrinsic  category  thus  includes  most  aptitude  constructs  available  from 
differential  psychological  research  and  there  is  the  suggestion  that  these  are 
not  process-based  at  all.  The  distinction  was  introduced  by  Jensen  (1967)  simply 
to  classify  prior  studies,  and  was  later  discarded  by  him.  Although  some  re- 
searchers have  maintained  the  contrast  as  substantively  important,  it  is  of 
doubtful  merit.  First,  to  assert  that  aptitude  measures  are  not  concerned 
with  psychological  processes  is  absurd,  even  though  differential  psychology  has 
not  yet  been  much  concerned  with  the  development  of  process  theories.  When  an 
aptitude  measure  relates  differently  to  learning  measures  under  different 
learning  conditions,  there  is  the  implication  that  that  aptitude  is  fundamentally 
involved  in  learning  processes  under  one  or  both  of  those  conditions.  Whether 
the  processes  represented  by  the  aptitude  measure  are  related  to,  or  are  the 
same  as,  those  represented  by  the  learning  measure  to  which  it  relates  is  an 
empirical  and  theoretical  matter.  There  is  considerable  evidence,  for  example, 
that  general  mental  tests  represent  the  ability  to  learn  in  conventional  in- 
struction, and  some  further  evidence  that  personality  constructs  combine  with 
mental  ability  to  magnify  or  dampen  this  relation.  The  task  for  further  re- 
search is  to  build  a more  detailed  process  theory  of  such  relations.  It  helps 
such  research  not  at  all  to  prejudge  the  matter  with  arbitrary  classifications . 

Relevant  here  also  is  the  optimistic  view  that  process-oriented  research 
on  individual  differences  in  learning  will  provide  "new  aptitudes"  (Glaser,  1972), 
different  in  kind  from  the  "old  aptitudes".  This  is  an  important  possibility. 
Since  individual  differences  in  mental  performances  almost  always  correlate 
however  (Guttman> 1976) , it  is  more  likely  that  the  new  and  the  old  will  differ 
in  form  more  than  in  kind,  and  that  an  improved,  integrated  conception  of  human 
cognition  will  need  to  be  built  on  their  combination.  Both  kinds  of  constructs 
always  must  be  included  in  such  research  anyway,  since  new  constructs  cannot 
be  defined  without  demonstrating  discriminant  validity  with  respect  to  existing 
constructs  (Campbell  and  Fiske,  1959).  This  admonition  applies  as  much  to  the 
old  artificial  distinctions  in  differential  psychology  between  ability,  style, 
and  personality,  as  it  does  to  new  artificial  distinctions  between  what  is 
measured  by  tests  and  what  is  measured  by  laboratory  task  parameters.  We  can 
strive  for,  and  expect,  new  aptitude  constructs,  but  these  are  likely  to  be 
woven  in  large  part  from  the  threads  of  existing  cloth  . 

A third  matter  can  be  dispensed  with  quickly.  Early  discussions  of  ATI, 
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including  those  of  the  present  writer,  emphasized  only  disordinal  interactions 
(where  regression  lines  intersect  within  the  aptitude  range) . Instances  of 
ordinal  interactions  were  ignored,  or  were  classed  with  instances  of  no  inter- 
action. But  ordinal  and  disordinal  interactions  can  have  similar  practical 
implications,  depending  on  other  aspects  of  instruction,  e.g.,  costs.  More 
importantly,  both  have  the  same  theoretical  implications.  The  distinction 
is  thus  unimportant  for  the  purposes  of  future  research. 

Finally,  the  ATI  approach  has  been  defined  by  some  as  relevant  only  to 
one  narrow  form  of  individualized  instruction,  i.e.,  to  situations  where  students 
can  be  assigned  to  alternative  instructional  treatments.  It  is  now  clear, 
however,  that  all  attempts  at  individualizing  instruction  rest  explicitly  or 
implicitly  on  hypothesized  ATI.  Anytime  an  instructional  prescription  for 
one  student  differs  from  that  provided  another,  there  is  the  suggestion  that 
each  is  best  helped  toward  some  common  goal  by  following  his  or  her  own  pre- 
scription rather  than  someone  else's.  Further,  the  evaluation  of  instructional 
prescriptions,  whether  individualized  or  not,  requires  an  ATI  approach  even 
where  there  is  no  intent  ultimately  to  assign  students  to  alternative  in- 
structional treatments.  In  describing  any  kind  of  instructional  effect,  one 
must  always  be  able  to  say  whether  the  description  given  holds  for  each  student 
involved.  Research  on  aptitude  thus  takes  a place  in  more  general  efforts  to 
build  instructional  theory.  A theory  of  aptitude  is  required  in  the  second,  or 
"description  of  initial  state",  part  of  Glaser's  (1976)  four-part  conception  of 
prescriptive  theory  for  instruction,  and  the  methodology  of  ATI  research  fills 
out  the  fourth  part — "assessment  of  instructional  effects". 

Instructional  Studies 

The  present  review  of  instructional  studies  concentrates  on  two  hypotheses. 
One  of  these  asserts  that  individual  differences  in  anxiety  (A^) , achievement 
via  independence  (A^),  and  achievement  via  conformity  (Ac)  interact  with  in- 
structional treatments  differing  in  the  degree  of  structure  and  participation 
provided  the  student.  This  will  be  referred  to  simply  as  the  A^A^A^  complex. 

The  other  concerns  general  mental  ability  (G)  and  the  extent  to  which  its  relation 

to  learning  outcome  varies  with  the  information  processing  burden  placed  on 

the  learner  by  the  instructional  treatment.  Since  general  ability  tests  typically 
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combine  types  of  items  that  have  been  theoretically  distinguished  (by  Cattell, 
1971,  and  Horn,  1976)  as  representing  crystallized  ability  (G^),  fluid  ability 
(Gf)  and  spatial  visualization  ability  (Gv> , this  hypothesis  will  be  referred 
to  as  the  G^G^G^  complex.  The  two  aptitude  complexes  are  probably  not  ortho- 
gonal . 

The  AjAcAx  complex.  A large  number  of  ATI  studies  that  used  personality 
constructs  as  aptitudes  were  summarized  by  Cronbach  and  Snow  (in  press)  under 
the  general  rubric  of  "constructive"  vs.  "defensive"  motivation.  It  was  hy- 
pothesized that  constructively  motivated  students,  those  ready  to  take  confident, 
self-directing  initiative  in  learning,  would  profit  in  less  directive  instruc- 
tional situations  that  allow  and  encourage  student  initiative;  defensively 
motivated  students,  those  more  anxious,  dependent,  or  conforming,  would  require 
situations  tailored  to  provide  a more  supportive  external  structure,  with  less 
demand  for  independent  action,  within  which  such  students  could  work  effectively. 
This  was  admittedly  a gross  and  oversimplified  contrast,  but  several  strong 
studies  had  provided  its  core  support.  These  certainly  were  worth  more  gen- 
eralized attention. 

Several  of  the  core  studies  were  contributed  by  Domino.  In  one  (Domino, 
1968),  college  students,  matched  on  sex  and  nonverbal  ability,  were  classified 
as  High-High,  Low-Low,  Low-High,  and  High-Low  on  the  basis  of  their  scores  on 
the  A^  and  A scales  from  the  California  Psychological  Inventory.  Domino  then 
interviewed  instructors  of  every  course  taken  by  the  students,  to  classify  the 
courses  as  "encouraging  conformity"  or  "encouraging  independence".  Variables 
such  as  emphasis  on  objectives,  memorization,  attendance,  etc.,  vs.  independent 
reading,  student  discussion,  informal  evaluation,  etc.,  were  used  to  direct 
the  classification.  It  was  found  that  students  showing  a High  A-  Low  A^, 
profile  achieved  better  grades  in  courses  where  they  were  encouraged  to  be 
independent.  Those  with  a Low  A^ - High  A^  profile  obtained  higher  average 
grades  in  the  more  structured  courses.  High -High  students  did  better  than 
Low  - Low  students  in  general. 

Domino  (1971)  then  followed  with  a formal  experiment,  in  which  High  A^- 
Low  A and  Low  A^~  High  A(  extreme  groups  were  defined.  Here,  a single  in- 
structor taught  four  sections  of  Introductory  Psychology,  two  in  a structured 
style  emphasizing  conformity  and  two  in  a style  emphasizing  independence.  Ex- 
treme groups  were  divided  randomly  among  the  two  treatments,  and  several  achieve- 
ment outcome  measures  were  used.  Again,  students  high  in  A^  and  low  in  Ap  did 
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best  with  instruction  favoring  independence;  students  low  in  A.  and  high  in 
were  better  off  with  instruction  requiring  conformity. 

In  another  line  of  work,  Dowaliby  and  Schumer  (1973)  contrasted  teacher- 
centered  vs.  student-centered  teaching  in  a junior  college  psychology  course. 

In  the  former  treatment  the  teacher  carried  the  communication  burden,  with 
little  participation  by  students.  In  the  latter  treatment,  students  were  en- 
couraged to  ask  questions  and  contribute  ideas.  The  aptitude  of  interest  was 
the  Taylor  Manifest  Anxiety  Scale  (A^) . There  were  two  examinations  as  outcome 
measures.  ATI  was  marked;  the  more  anxious  students  were  better  treated  by 
teacher-dominated  instruction,  while  less  anxious  students  achieved  more  with 
participative  instruction. 

Domino  (1974)  also  replicated  the  Dowaliby-Schumer  results.  College  students 
in  an  English  course  were  assigned  to  teacher-centered  or  student-centered  in- 
struction. A^  again  interacted  with  achievement  outcome,  showing  that  outcome 
was  superior  for  high  A^  students  in  the  teacher-centered  approach  and  for  low 
A^  students  in  the  student-centered  approach. 

These  studies  along  with  related  research  reviewed  by  Cronbach  and  Snow 
(notably  a series  of  studies  by  McKeachie),  are  sufficient  to  establish  the 
importance  of  ATI  in  this  domain,  bvit  the  conclusion  sustained  by  their  combined 
results  is  not  really  satisfactory.  Several  questions  need  answers.  Are  treat- 
ment contrasts  characterized  as  structured  vs.  participative,  or  conforming  vs. 
independent,  or  teacher-centered  vs.  student-centered  similar?  In  what  ways? 

Is  this  a college-level  phenomenon  only,  or  are  similar  interactions  observable 

at  lower  levels  of  education?  Are  A, , A , and  A different  faces  of  the  same 

i c x 

general  motivational  construct?  How  are  any  or  all  of  these  aptitude  variables 
related  to  G,  which  has  been  found  in  other  research  to  combine  with  A^  in 
higher-order  interactions?  Two  more  recent  investigations  have  sought  to 
push  our  understanding  of  this  complex  of  questions  further. 

A study  by  Peterson  (1976)  obtained  striking  interactions  which  correspond 
roughly  to  Domino's  results  but  complicate  the  Dowaliby-Schumer  interpretation 
appreciably.  They  also  show  the  constructive-defensive  hypothesis  of  Cronbach- 
Snow  to  have  been  overly  simple.  Peterson  defined  four  treatment  conditions 
to  distinguish  between  the  Domino  emphasis  on  teacher  structure  and  the 
Dowaliby-Schumer  emphasis  on  student  participation.  One  teacher  taught  a two- 
week  unit  on  alienation  to  each  of  four  high  school  classes  in  Social  Studies 
(N»94) , assigned  at  random  to  a 2x2  design.  The  treatments  were  defined  as 
high  structure  and  high  participation  (HS:HP),  high  structure  and  low  parti- 
cipation (HS:LP),  low  structure  and  high  participation  (LS:HP),  and  low 
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structure  and  low  participation  (LS:I.P).  Peterson  trained  the  teacher  in 

these  four  styles  using  videotape  models.  A description  of  teacher  behavior 

in  the  four  treatments  was  built  into  the  models,  and  later  verified  by 

classroom  observations.  Thus  a fairly  detailed  account  of  teacher  behavior 

in  each  treatment  was  available.  She  also  included  four  aptitude  constructs: 

G (based  on  a Verbal  Comprehension  Test),  A^  (combining  the  Children's  Manifest 

Anxiety  Scale  and  the  Spielberger  Measure),  A plus  A.,  and  A minus  A,. 

c i c i 

The  latter  two  variables  were  defined  to  distinguish  general  motivation 
toward  achievement  from  special  orientation  toward  independence  vs.  conformity. 
(It  should  be  noted  here  that  aptitudes  entered  the  regression  equation  in  the 

order  G,  A , A plus  A.,  A minus  A.,  so  the  latter  are  partial  variates.) 

x c i c i 

Outcome  measures  were  immediate  and  delayed  multiple  choice  tests,  an  essay 

test,  and  several  attitude  scales. 

Multiple  regression  analysis  on  immediate  achievement  showed  a main 

effect  for  G,  no  main  effect  for  treatment,  and  two  substantial  ATI  effects. 

One  of  these,  for  A^  minus  A.,  replicated  the  Domino  findings,  as  shown  in 

Figure  1.  The  I,S:LP  treatment  was  best  for  students  high  in  A.  and  low  in  A . 

1 c 

Next  best  for  these  students  was  the  HS:HP  treatment.  For  students  with  the 
opposite  pattern,  i.e.,  high  A^  and  low  A.,  the  HS:LP  treatment  was  superior. 
Next  best,  at  least  for  those  in  this  group  with  high  general  motivation,  was 
LS:HP.  Peterson  concluded  that  conforming  students  needed  one  clear  strategy 
to  which  they  could  conform.  This  would  preferably  be  teacher  structure.  With- 
out that,  the  structure  provided  by  peer  participation  might  substitute,  at 
least  for  those  high  enougli  also  on  A..  Having  structure  provided  by  neither 
teacher  nor  students  was  particularly  inadequate  for  these  learners.  The  in- 
dependent students  thrived  in  this  latter  environment,  particularly  if  they 

were  quite  low  on  A . Presumably,  t+TPy’  preferred  and  were  able  to  provide 

c 

their  own  structure.  For  High  A.  students  also  showing  some  degree  of  A^ , 
however,  HS:HP  and  LS:HP  were  not  inadequate  treatments.  Results  for  the 
other  cognitive  outcome  measures  were  similar,  although  a shift  among  inde- 
pendent and  highly  motivated  students  was  noted  on  the  essay  test;  the  high 
participation  conditions  were  most  beneficial  here.  Also,  ATI  effects  faded 
somewhat  at  retention. 


Figure  1 here 
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Multiple  Choice  Achievement 


Achievement  via  Conformance  (Ail  mmut  Achievement  via  Independence  'Ail 


Low  Structure  ' High  Participation 


Regression  planes  for  four  treatments  using  plus  A1  and 

A minus  A as  aptitudes  and  multiple-choice  achievement  as 
C 1 

outcome  (after  Peterson,  1976). 


Figure  1 


The  other  ATI  involved  G and  A . G alone  gave  no  ATI,  nor  did  A alone. 

X X 

But  the  multiplicative  combination  of  G and  A yielded  the  result  shown  in 
Figure  2 (the  four  planes  are  separated  here  to  simplify  what  would  be  a 
highly  complex  figure).  The  operative  treatment  in  the  interaction  is  LS:LP. 
It  was  the  worst  treatment  by  far  for  students  low  in  both  aptitudes  or  high 
in  both.  It  was  the  best  treatment  by  far  for  students  with  the  opposite  ap- 
titude patterns,  i.e..  Low  G and  High  A or  High  G and  Low  A . The  other 

x x 

treatments  did  not  differ  appreciably;  HS:HP  and  HS:LP  would  apparently  be  the 
choice  for  students  throughout  a broad  diagonal  path  running  from  the  Low-Low 
to  the  High-High  aptitude  corners  of  the  cube.  In  the  middle  range,  LS:LP 
would  not  differ  from  these.  (While  the  figure  for  LS:LP  gives  the  impression 
of  a curved  surface  on  the  diagonal,  it  should  be  thought  of  as  a twisted 
sheet;  Peterson  did  not  fit  curvilinear  teims.)  Again  on  the  essay  test  a 
slight  shift  occurred,  principally  among  the  able  anxious  students,  for  whom 
some  structure  was  here  beneficial.  The  retention  measure  showed  similar  but 
less  distinct  ATI. 


Figure  ? here 

Apparently,  nonanxious  learners  who  lack  sufficient  ability,  and  able 
learners  who  are  also  highly  anxious  both  need  teacher  structure  to  do  well. 

One  might  hypothesize  that  the  less  able,  less  anxious  (underaroused?)  students 
need  to  be  shown  directly  and  motivated  to  do  the  required  things,  while  the 
more  able  but  anxious  (overaroused?  defensive?)  students  need  the  certainty 
that  th« y are  doing  the  required  things.  Without  external  direction  on  this, 
the  former  students  may  remain  lost  and  apathetic  while  the  latter  lose  them- 
selves through  frantic  shifts  of  attention  and  effort. 

The  results  are  consistent  with  Spielberger ' s state-trait  theory  of  anxiety 
(Gaudry  & Spielberger,  1971;  Spielberger,  1972).  Both  kinds  of  students  need 
to  reach  an  optimal  level  of  state-anxiety  for  effective  performance.  Teacher 
direction  brings  the  High  G-High  A students  down  to  this  level;  teacher  demand 
brings  the  Low  G-Low  A^  students  up  to  this  level.  Able,  nonanxious  students, 
on  the  other  hand,  are  benefitted  by  absence  of  formal  structure  and  partici- 
pation, perhaps  for  the  same  reasons  that  independent  students  are  benefitted 
by  this  treatment.  In  a moderately  difficult  task,  these  students  are  aroused 
enough  and  able  enough  to  organize  their  own  work  effectively.  It  is  less 


clear  why  LS:LP  serves  the  less  able,  anxious  students  well.  Following  the 
Speilberger  theory,  one  could  say  that  anxiety  present  In  these  students 
already  serves  an  optimal  activating  function  so  situational  demand  is  unnec- 
essary, even  debilitating.  Without  forced  structure  or  participation,  these 
students  can  work  inconspicuously.  In  short,  the  inclusion  of  ability  and 
the  distinction  of  structure  and  participation  seems  to  elaborate  the  Dowaliby- 
Schumer  finding  substantially,  and  to  bring  some  extant  theory  to  bear  on  ATI 
processes . 

With  attitude  as  outcome,  no  ATI  were  statistically  significant.  Their 

pattern,  however,  was  in  some  cases  opposite  that  obtained  for  the  cognitive 

outcomes.  HS:HP  yielded  the  most  favorable  attitudes  toward  the  content 

learned  among  students  high  on  A.  and  low  on  A^ , while  LS:LP  produced  the 

most  favorable  attitudes  among  students  low  on  A,  and  high  on  A . Also, 

i c 

students  high  on  G and  A^  said  they  liked  LS:LP  best,  while  students  high  on 

G and  low  on  A liked  LS:HP  best;  yet  in  both  cases  performance  was  lowest  in 
x 

these  treatments.  Such  reversals  seem  to  present  troubling  contradictions. 
Peterson  hypothesized  that  high  school  students  may  be  predisposed  to  say  they 
prefer  the  "other"  treatment  and  to  be  less  happy  under  conditions  where  they 
expend  more  energy,  even  if  they  are  learning  more  in  the  process.  This  re- 
minds us  that  attitude  may  often  function  as  an  intervening  variable,  al- 
though it  is  measured  at  posttest.  But  the  theory  noted  above  cannot  handle 
this.  It  is  hard  to  see  how  able  anxious  students,  for  example,  can  be  made 
to  feel  even  mo^  anxious  by  a treatment,  learn  less  in  the  process,  and  say 
they  like  it.  Attitude  as  outcome  is  not  well  conceptualized  or  measured  in 
many  instructional  studies  and  there  is  no  theory  to  go  on;  even  so,  such 
puzzles  need  to  be  checked. 

Another  distinctive  feature  of  the  Peterson  study  was  the  inclusion  of 
seven  other  classes  (N=166)  in  which  the  same  content  and  instruments  were  used, 
but  without  systematic  treatment  variation.  These  teachers  were  observed  in  an 
mpt  to  classify  their  natural  styles'  as  siTRiiar-to— one  or  -anothex  of  the 
experimental  treatments.  It  was  hoped  that  these  comparison  classes  would 
serve  to  test  the  representativeness  of  the  experimental  classes  and  to 
crossval idate  their  regression  equations.  The  results  were  somewhat  mixed. 

Some  comparison  classes  gave  results  consistent  with  predictions,  some  did  not. 


A multidimensional  scaling  of  distance  measures  among  the  11  class  regression 
equations  suggested  that  HS:HP  and  LS:HP  were  more  similar  to  other  natural 
classes  in  the  school  than  were  HS:LP  and  LS:LP.  In  addition,  it  was  clear 
that  other  treatment  dimensions  were  operating  in  the  comparison  classes  to 
influence  similarity  of  regression  equations  obtained  in  each. 

Porteus  (1976)  was  able  to  study  some  of  the  same  variables  in  a year- 
long investigation  covering  two  subject  matters,  Economics  and  Educational 
Philosophy.  Students  (N=56)  in  a private  high  school  were  assigned  to  one  of 
two  sections  of  each  course,  all  taught  by  the  same  teacher.  In  one  section, 
a teacher-centered  treatment  was  used,  with  required  attendance,  daily  reading 
assignments  and  homework,  frequent  quizzes,  and  term  projects  all  chosen  by 
the  teacher.  In-class  work  was  teacher-directed,  with  previews  and  summaries. 
The  other  section  was  student-centered;  students  initiated  class  discussion, 
requested  quizzes  and  homework  as  they  wished,  chose  projects,  and  reading  was 
assigned  in  2-4  week  blocks.  Outcomes  were  three  achievement  tests  spaced 
across  each  two-term  course,  an  essay  measure  added  to  the  third  of  these,  and 
student  attitude  and  course  perception  questionnaires  after  each  test.  Class- 
room audiotapes,  a teacher  log,  and  student  perceptions  helped  describe  in- 
struction over  the  year. 

The  aptitude  battery  Initially  included  the  same  A and  A measures  used 

i c 

previously  plus  the  Flexibility  Scale  from  the  same  inventory,  A^  (Spielberger 's 

scale),  several  measures  of  G,  or  G (Necessary  Arithmetic  Operations,  Hidden 

f c 

Figures,  SAT-V,  and  SAT-Q) , a questionnaire  designed  by  Myers  (1964)  as  a sub- 
stitute for  oroiective  measures  of  achievement  motivation,  and  a paragraph 
completion  test  used  by  D.  Hunt  (1975)  to  represent  a cognitive  style  construct 
called  "conceptual  level".  The  correlation  pattern  and  small  sample-  size, 
however,  suggested  that  these  variables  should  be  reduced  to  a smaller  number 
of  factors  if  possible.  Porteus  chose  four  orthogonal  aptitude  factors  for 

the  ATI  analysis.  These  were!  G (since  a distinction  between  G,  and  G could 

f c 

not  be_obtained);  A^  (a  factor  combining  A^  and  Flexibility);  Ac  (a  factor  com- 
bining A^  , negative  Flexibility,  and  the  achievement  motivation  questionnaire); 
and  A^.  The  expansion  of  the  A^  and  constructs  by  the  additional  components 
does  not  seem  to  change  their  meaning  appreciably,  so  labels  consistent  with 
prior  work  have  been  used  here.  (Porteus  used  somewhat  different  labels.) 

Also,  the  conceptual  level  measure  was  split  by  the  factor  analysis,  with 
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positive  loading  on  G,  negative  on  A^.  This  makes  sense;  the  scoring  of 
paragraph  completion  favors  conceptually  complex  and  unrestricted  verbal 
production,  as  opposed  to  simple,  concrete,  restricted  production.  Ability 
should  enhance,  and  anxiety  inhibit,  such  production.  This  hint  that  one 
style  construct  may  represent  a combination  of  ability  and  personality  con- 
structs, contrasting  High  G-Low  A^  with  Low  G-High  A , should  spur  further 
research  in  this  direction. 

In  Economics,  multiple  regression  analysis  showed  main  effects  for  C 

and  A on  all  but  one  outcome.  The  essay  test  gave  main  effects  associated 
c 

with  all  four  aptitudes,  with  that  for  Aj  the  largest.  There  were  no  treat- 
ment main  effects,  but  several  ATI  were  pronounced.  In  Education,  no  effect - 

of  any  kind  were  nominally  significant,  although  G gave  relatively  large  main 

2 

effects  and  ATI  appeared  noteworthy.  R for  the  full  regression  model  in  F.d- 

2 

ucation  accounted  for  only  36%  to  51%  of  the  variance;  in  Economics  R ranged 

from  65%  to  88%.  This  suggests  a difference  due  to  subject  matter. 

For  AJ  and  A in  Economics,  ATI  was  substantial  only  on  Test  2;  High  A.- 
i c i 

Low  A students  did  best  in  student-centered  instruction,  while  I.ow  A -High  A 
c i c 

students  were  better  off  in  the  teacher-centered  condition.  This  seems  consis- 
tent with  Domino's  findings  and  with  Peterson's,  if  one  looks  only  at  her  low 
participation  groups.  Test  2 was  given  at  the  end  of  the  first  term,  so  t !u* 
timing  also  corresponds  to  Domino.  But  results  for  the  other  outcomes  did  not 
match  earlier  findings,  and  it  was  the  regression  plane  for  the  student -cent ered 
group  that  seemed  to  shift.  On  Test  1,  this  treatment  was  poorest  for  Low  A 

c 

students,  with  no  differences  for  other  students.  On  Test  3 and  the  essay, 

teacher-centered  instruction  was  best  for  Low  A students,  and  High  A students 

c c 

did  best  with  student-centered  instruction.  A^  had  only  slight  moderating 
effects  on  these  trends.  In  Education,  all  the  results  were  faint.  Test  1 data 
seemed  to  match  those  for  Economics  Test  2,  again  replicating  Domino  and  Peter- 
son. But  Education  Test  2 showed  results  like  those  of  the  third  stage  measures 
in  Economics,  and  ATI  were  absent  at  the  third  stage  in  Education. 

Considering  ATI  for  G and  A^,  Porteus's  results  were  remarkably  similar 
to  Peterson's,  in  both  Economics  and  Education,  if  one  is  willing  to  equate  the 
student-centered  and  LS:LP  conditions  on  Test  1 and  then  make  some  allowance 
for  student  and  teacher  adaptations  across  Porteus's  very  long  time  scale. 


re.'**''  n.  « 
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Figures  3 and  4 give  these  results,  for  Economics  and  Education  respectively. 
In  each  figure,  panels  a,  b,  c,  and  d correspond  to  Tests  1,  2,  3,  and  the 
essay,  respectively.  The  most  striking  ATI  appeared  on  Economics  Test  1. 

The  student-centered  treatment  assumed  the  same  twisted  sheet  pattern  seen 
earlier  in  Figure  2 for  LS:LP,  while  the  teacher-centered  treatment  yielded 
a sloped  regression  plane  quite  similar  to  the  planes  for  Peterson's  other 
three  treatments  in  Figure  2,  As  before.  Low  G-Low  A^  and  High  G-High  A^ 
students  did  h fnT~~w i t h teacher-centering;  Low  G-high  A^  and  High  G-Low  A^ 
students  profited  more  under  student-centering.  Porteus  also  did  not  test 
for  curvilinearity. 


Figures  3abcd  and  4abcd  here 

On  later  Economics  outcomes,  the  shapes  of  the  two  regression  planes 
were  retained,  although  the  twisted  appearance  of  the  student-centered  treat- 
ment faded  somewhat  over  time.  Some  changes  in  elevation  of  the  planes  over 
time  also  would  change  the  resulting  decision  rules  for  assigning  students  to 
treatments;  this  is  unimportant  here  since  no  such  conclusions  would  be 
drawn  from  this  study  alone  anyway,  given  the  extremely  small  sample  size. 

The  similar  regression  patterns  nonetheless  give  encouragement  to  the  general 
hypothesis  when  they  match  those  of  other  studies. 

In  Education,  there  was  no  ATI  at  Test  1 (Figure  4a).  After  that,  the 
same  pattern  as  seen  in  the  Economics  data  emerged  across  time,  becoming  most 
pronounced  on  the  essay  test.  Here  again,  then.  High  G-Low  A^  and  Low  G-High  A^ 
students  did  best  with  student-centering;  students  low  on  both  aptitudes  or 
high  on  both  did  best  with  teacher-centering. 

Analyses  of  attitude  toward  the  instruction  yielded  only  one  noteworthy 
ATI.  Low  G-High  A^  students  in  Economics  felt  better  about  the  instructional 
method  in  the  student-centered  section  than  their  counterparts  did  in  the 
teacher-centered  section.  A few  contradictory  trends  like  those  Peterson 
found  were  seen,  but  there  were  also  instances  in  which  attitude  and  achieve- 
ment outcomes  showed  similar  patterns. 

The  interpretation  applied  to  Peterson's  results  using  Spielberger ' s 
theory  can  be  applied  here  also  to  account  for  the  general  ATI  pattern.  But 
Porteus 's  detailed  description  must  be  relied  upon  beyond  this.  Evidence  from 
classroom  audiotapes,  teacher  logs,  and  student  questionnaires  suggested  several  * 

factors  operating  between  courses,  and  within  courses  over  the  year,  that  might 
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Figure  3.  Regression  planes  for  two  treatments  using  G and  A as  aptitudes 
and  Economics  Test  1 (a).  Test  2 (b).  Test  3 (c),  and  Essay  (d) 
as  outcomes  (after  Porteus,  1976). 
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help  account  for  the  trends  observed.  First,  Economics  was  the  more  difficult 
subject  matter,  and  the  more  highly  structured  sequentially,  relative  to 
Educational  Philosophy.  These  differences  may  be  inherent  in  the  subject 
matter.  It  seems  fair  to  say  that  Economics  presents  more  of  a "serialist" 
learning  task,  while  Educational  Philosophy  is  more  "holist"  in  organization, 
to  use  a cognitive  style  distinction  suggested  by  Pask  and  Scott  (1972)  . 

Second,  perhaps  as  a result  of  this,  the  teacher  felt  that  daily  preparation 
and  class  attendance  was  essential  in  Economics;  in  Education,  students  could 
muddle  through  without  as  much  daily  concern.  The  absence  of  structured  text 
material  in  Education,  however,  increased  the  danger  here  that  students  would 
not  find  out  what  they  did  not  know  until  much  later.  In  the  student-centered 
section,  he  reported  that  students  were  not  well  prepared  in  either  course  be- 
fore Test  1,  particularly  in  Education.  Third,  since  the  teacher  felt  obligated 
not  to  stand  by  and  allow  students  to  fail,  he  compromised  the  treatments  over 
time.  After  Test  1,  he  increased  his  initiation  of  class  discussion  in  the 
student-centered  sections.  His  urging  produced  more  class  attendance  in  both 
courses.  After  Test  2 in  Economics,  he  gave  extra  help  in  evening  sessions  to 
students  needing  it.  This  was  not  done  in  Education,  since  day-to-day  con- 
fusions were  not  judged  problematic.  But  he  did  institute  some  optional 
variations  on  a term  project  in  Education  between  Tests  2 and  3.  Finally,  in 
Economics,  student  perceptions  characterized  the  treatments  as  different  through- 
out the  year.  In  Education,  the  treatments  were  perceived  as  different  earlier 
but  not  later  in  the  year.  The  teacher  concurred,  feeling  that  treatment 
differences  were  less  profound  in  both  courses  between  Tests  2 and  3.  In  either 
case,  the  perceived  differences  centered  on  procedural  and  out-of-class  di- 
mensions; in-class  differences  were  perceived  as  relatively  slight  throughout 
the  year. 

In  sum,  it  seems  quite  possible  that  course  differences  in  apti tude-treat- 
ment  adaptation  over  the  year  account  for  at  least  some  of  the  ATI  variations. 

With  the  more  structured  and  difficult  Economics  subject-matter,  G and  A^  played 
a more  major  role,  particularly  on  early  tests,  in  student-centered  instruction. 
This  was  the  high  stress  period  in  Economics.  Over  time,  the  compromised  treat- 
ment contrast  could  have  lessened  the  effect  while  leaving  ATI  roughly  similar 
in  form.  With  Education  subject-matter,  difficulties  crept  up  on  students  as  the 
year  progressed,  apparently  without  their  perceiving  the  treatment  contrast  being 
maintained.  Here,  then,  much  of  the  learning,  and  the  feeling  of  stress,  probably 


occurred  late  in  the  course.  It  Is  not  surprising  then  that  ATI  for  G and  A 

x 

showed  most  strongly  on  the  final  essay.  (See  Cronbach  and  Snow  for  other  studies 
showing  Ax  effects  varying  with  stress.)  This  may  account,  too,  for  the  confir- 
mation of  Domino's  A A hypothesis  only  on  Education  Test  1;  the  High  A student 
begins  independent  learning  without  prompting.  This  is  inconsistent  with  the  fact 
that  in  Economics  Domino's  effect  appeared  only  on  Test  2,  unless  one  says  that  it 
takes  longer  for  High  A.  to  become  effective  in  this  difficult  course.  The  teacher 

did  report  that  his  Economics  section  finally  became  "truly"  student-centered  as 
report  that  his  Economics  section  finally  became  "truly"  student-centered  as 

Test  2 approached.  And  evening  tutoring  (for  Low  A^  students?)  was  instituted 

after  that.  Admittedly,  this  is  all  impressionistic  description,  but  it  is 

credible,  and  it  clues  the  next  investigator  about  variations  to  expect. 

Both  the  Peterson  and  Porteus  studies  were  forced  to  use  small  samples 

of  students,  the  Porteus  study  extremely  so  within  subject-matters,  so  their 

statistical  power  was  weak  and  the  regression  surfaces  may  be  poorly  estimated. 

But  both  were  strong  in  the  sense  of  ecological  sampling.  Peterson  contrasted 

four  sharply  defined  treatments  within  a school  context  where  seven  other 

naturally  varying  classes  could  be  studied  comparatively.  Porteus  was  able  to 

trace  the  course  of  apt i tude- t rea tmen t adaptation  across  a year's  instruction 

in  two  subject-matters.  Most  importantly,  both  yielded  ATI  consistent  enough 

with  previous  research,  and  with  each  other,  to  verify  that  the  A, A A complex 

i c x 

is  an  important  source  for  ATI  research. 

Yet,  understanding  of  this  phenomenon  is  still  not  sufficient  for  theory 
or  practice.  Despite  obtaining  fairly  detailed  descriptions  of  treatments, 
the  two  studies  cannot  really  map  their  treatments  onto  one  another.  What  is 
needed  is  an  observation  system  designed  specifically  to  study  this  ATI  com- 
plex both  within  and  between  school  situations.  For  the  present,  other  im- 
pediments to  integration  remain.  The  curvilinear  functioning  of  A^  mentioned 
above  needs  more  detailed  checking,  the  two  studies  do  not  define  A^  and  A^ 
identically,  and  neither  study  gives  us  a clear  picture  of  how  these  three 
constructs  fit  together.  Peterson  and  Porteus  are  working  now  to  coordinate 
their  analyses  and  results  on  these  points.  Some  further  information  can  be 
given  here  on  the  last  matter. 

In  both  studies,  there  was  evidence  of  positive  correlation  between  G and 

A.,  and  negative  correlation  between  A and  A . Table  1 shows  these  and  other 
i c x 

results.  (Since  Peterson  used  simple  variables  rather  than  factors,  the 
table  gives  simple  results  rather  than  factor  combinations  for  Porteus. 
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Also  Peterson  used  a verbal  ability  test  alone,  so  Porteus's  SAT-V  results 

are  shown  rather  than  those  based  on  her  general  factor.  Both  are  labeled  G . ) 

c 

Note  that  the  correlation  pattern  was  similar  across  studies  except  for  A , 

c 

which  seemed  to  relate  to  G and  A,  in  the  Peterson  data  but  not  in  the  Porteus 

c i 

data.  These  relations  suggest  that  attempts  at  tracing  out  a more  detailed 

and  coordinated  network  among  these  constructs  might  pivot  on  G.  It  appears 

to  relate  to  A^  and  A^  more  highly  than  these  measures  do  to  one  another. 

A seems  to  run  counter,  particularly  to  A . And,  there  is  much  evidence, 
x c 

in  this  chapter  and  elsewhere,  showing  interaction  of  G and  Ax.  Further  re- 
search should  capitalize  on  such  relations,  seeking  convergent  and  discriminant 
process  descriptions  of  these  dimensions,  perhaps  using  observed  classroom  be- 
havior and  study  skills.  The  reliability  estimates  given  in  the  main  diagonals 
of  Table  1 add  the  suggestion  that  A.  and  A^  are  not  well  measured  by  the  CPI 
scales  alone.  Another  useful  project  for  further  research  would  be  to  build 
alternative  measures  for  these  constructs. 


Table  1 about  here 

Some  hypotheses  to  guide  process  analysis  come  from  the  bottom  half  of  Table 

1.  Selected  correlations  are  reported  from  a study  by  Kutkowski  and  Domino 

(1975)  relating  the  CPI  scales  to  an  inventory  of  study  skills  and  habits  among 

college  students.  (Here,  the  correlations  for  A are  approximated,  since  CPI 

contains  no  A scale.  The  approximation  was  based  on  correlations  from 
x 

other  CPI  scales  that  form  a factor  commonly  thought  to  represent  the  opposite 

of  A .)  A relates  rather  strongly  to  all  four  of  these  dimensions  of  study 
x c 

behavior.  A,  seems  associated  with  ail  but  delay  avoidance.  G and  A are 
1 c x 

apparently  concerned  only  with  work  methods.  Unraveling  such  relationships 
would  be  a useful  next  step. 

The  G G,C  complex.  All  of  the  above  argues  for  the  addition  of  G to  the 
c t v 

A^AcA  string.  But  G is  itself  a complex.  The  hierarchical  model  of  ability 
organization,  as  interpreted  by  Cattell  (1971),  has  at  its  highest  level  a 
distinction  between  fluid-analytic  intelligence  (G^)  and  crystallized-verbal 
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Table  1 


Correlations  among  Aptitudes  Reported  by 
Peterson  (1976),  Porteus  (1976),  and 
Rutkowski  and  Domino  (1975). 


Peterson  Experimental  Students  (N=94)  Porteus  Experimental  Students  (N=56) 

Aptitudes  G A A A G A A A 

r c lex c i c x 


G .94a  .47  .34 

c 

-.10 

.63 

.33 

.11 

-.10 

At  .48  .29 

-.16 

.62 

.07 

-.04 

A .65 

c 

-.45 

.70 

-.23 

A 

X 

.84 

.81 

Rutkowski-Domino  College 

Frosh  (N=201) 

Delay  Avoidance  - SSHA*5 

-.02 

.07 

.48 

.01' 

Work  Methods  - SSHA 

.32 

.30 

.41 

-.31 

Teacher  Approval  - SSHA 

.17 

.42 

.41 

-.12 

Education  Acceptance  - SSHA 

.10 

.26 

.49 

-.12 

Notes : 

a Coefficients  in  the  main  diagonals  are  reliability  estimates  given 
by  the  respective  authors.  Porteus 's  coefficient  for  G is  the 
coramunality  from  her  factor  analysis,  and  is  thus  an  unSerestimate . 

b The  four  scales  of  the  Survey  of  Study  Habits  and  Attitudes. 

c Estimated  as  the  reflected  average  correlations  between  SSHA  scales 
and  the  five  CPI  scales  with  highest  loading  on  Factor  II,  "Poise 
and  Self-assurance . " 
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intelligence  (G  ).  After  decades  of  factor  analytic  debate,  this  or  related 
hierarchica 1 views  have  finally  become  popular  because  they  fit  existing  data 
rather  well  while  offering  some  hope  of  parsimony.  More  specialized  dis- 
tinctions among  abilities,  such  as  Guilford's  (1967),  can  be  fit  into  the 
more  specific  levels  of  the  hierarchy.  Horn  (1976)  summarized  much  recent 
correlational  research  under  these  rubrics,  adding  spatial  visualization 
ability  (G^)  to  the  general  level.  The  provisional  use  here  of  this  three- 
sided  concept  of  G seems  justified. 

Cronbach  and  Snow  (in  press)  adopted  a hierarchical  view  in  attempting 
to  make  sense  of  the  large  number  of  ATI  studies  that  rest  on  measures  of 
one  or  another  of  these  constructs.  Unfortunately  no  large-scale  ATI  studies 
attack  this  complex  squarely.  In  most  past  research,  ATI  must  be  attributed 
to  an  undifferentiated  G.  And  studies  based  on  more  specialized  ability  con- 
structs cannot  be  understood  as  such,  in  the  hierarchical  view,  unless  they 
rule  out  interpretation  in  terms  of  G by  including  general  measures.  Few  do. 
Titus . the  present  section  must  rely  mainly  on  a scattering  of  small-scale 
instructional  experiments  that  are  impossible  to  integrate  in  a way  comparable 

to  the  studies  on  the  A. A A complex. 

i c x 

G has  been  the  most  widely  studied  aptitude  construct  in  instructional 
research  and  thus  has  produced  the  most  ATI.  Table  2 attempts  a summary  of 
the  kinds  of  treatments  that  seemed  to  alter  the  relation  of  G to  outcome 
most  substantially,  based  on  the  Cronbach-Snow  review.  The  general  hypothesis 
derived  from  this  review  is  stated  by  I terns  1)  and  11)  of  the  table;  in- 
structional treatments  differ  in  the  information  processing  burdens  they  place 
on,  or  remove  from,  the  responsibility  of  the  learner,  and  the  regression  slopes 
of  cognitive  outcomes  on  G become  steeper  or  shallower  accordingly.  It  seems 
that  the  more  learners  are  required  to  puzzle  things  out  for  themselves,  to 
organize  their  own  study  and  build  their  own  comprehension,  the  more  able 
learners  can  capitalize  on  their  strengths  profitably.  As  instructional  treat- 
ments are  arranged  to  relieve  learners  from  difficult  reading,  analyzing  complex 
concepts,  and  building  their  own  cognitive  structures,  the  more  such  treatments 
seem  to  compensate  for  or  circumvent,  less  able  learners'  weaknesses.  These 
latter  treatments  help  Low  G students;  the  High  G students  may  or  may  not  do 
well  in  them.  Often,  the  impression  from  such  studies  is  that  High  G students 
can  do  well  enough  no  matter  what  treatment  is  applied. 
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Table  2 here 


But  this  is  not  always  the  case.  Item  16)  in  Table  2 suggests  that  some 

kinds  of  treatment-supplied  cognitive  models  can  actually  be  harmful  to  High  G 

students.  Whether  such  an  effect  is  widespread,  and  whether  it  holds  for  G , 

c 

Gj,  and  Gv«  or  for  only  one  or  two  of  these  constructs,  are  important  research 
questions.  G^  and  Gv  are  easily  separated  correlationally.  (This  is  essen- 
tially the  distinction  between  v:ed  and  k:m  consistently  reported  in  British 
factor  analytic  studies.)  But  G^  is  not  readily  separable  from  the  other  two 
in  this  way.  Experimental  manipulations  that  could  make  such  distinctions 
would  sharpen  construct  validity,  especially  of  G^,  as  well  as  providing  pro- 
cess hypotheses  about  ATI. 

Two  lines  of  research  relating  to  this  question,  by  Salomon  and  by  Greeno 
and  Mayer,  were  summarized  by  Cronbach  and  Snow.  They  are  treated  here  briefly, 
followed  by  the  most  recent  studies  by  Mayer  and  Greeno,  along  with  scattered 
other  work. 

An  early  study  by  Salomon  (1968)  used  a commercial  film  to  give  either  "cue 

attendance"  or  "hypothesis  generation"  training  to  student  teachers.  The  former 

asked  the  students  to  notice  many  stimulus  details  in  the  film;  the  latter  asked 

them  to  produce  possible  themes  for  the  story  depicted.  The  film  scenes  were 

shown  in  scrambled  order  to  some  groups.  A posttest  was  scored  for  information 

seeking  (questions  asked  in  a problem-solving  transfer  task  about  a curriculum 

organization  problem).  G£  (GRE-V)  related  to  outcome  positively  after  hypothesis 

generation  training,  but  negatively  after  cue  attendance  training.  Apparently, 

Low  G students  were  helped  by  practice  in  attending  to  details,  but  High  G 
c c 

students  were  hurt  (or  bored)  by  this.  Hypothesis  generation  may  have  required 
more  synthesizing,  which  perhaps  overloaded  Low  G^  students  while  challenging 
High  Gc  students. 

Salomon  (1974)  then  investigated  the  effects  of  modeling  a skill  process 
by  film.  Two  studies  concerned  the  cue-attendance  skill,  defined  in  terms  of 
both  quantity  of  details  noticed  in  visual  displays  and  organization  of  spatial 
scanning  (perhaps  an  aspect  of  G^  and  G^) . The  visual  displays  were  complex 
paintings.  A cue  attendance  measure  served  as  both  pre  and  posttest,  yielding 
quant < ty  and  organization  scores.  Another  posttest  measured  information  search 
in  problem-solving.  In  one  treatment  condition,  a film  demonstrated  the  process 
of  zooming  in  on  closeups  of  specific  details  one  after  another  in  sequence. 


Table  2 


A Summary  of  ATI  Hypotheses  Relating 
General  Ability  (G)  to  Various  Instructional  Treatments  (T) 


When  T is  described  as... 


...  the  expected  results  for  students 
described  on  G as  Low  and  High  are... 


1)  placing  burdens  of  information  processing  on  learners, 

2)  using  elaborate  or  unusual  explanations, 

3)  a "new"  curriculum, 

4)  including  discovery  or  inquiry  methods, 

5)  encouraging  learner  self-direction, 

6)  relatively  unstructured  or  permissive, 

7)  relying  heavily  on  verbiage, 

8)  rapidly  paced, 

b 

9)  giving  minimal  essentials  by  PI  , for  learners  to  elaborate 

10)  giving  advance  organizers  on  difficult  material, 

11)  relieving  learners  of  information  processing  burdens, 

12)  giving  all  essentials  by  PI, 

13)  simplifying  or  breaking  down  the  task, 

14)  providing  redundant  text, 

15)  substituting  other  media  for  verbiage, 

16)  using  simplified  demonstrations,  models,  or  simulations, 

17)  varying  the  format  of  PI, 

18)  including  inserted  questions, 

19)  using  diagranmatic  or  pictorial  presentation, 

20)  based  on  specialized  film  or  TV. 


and  + respectively 


- 

II 

+ 

II 

- 

II 

+ 

II 

- 

II 

+ 

II 

- 

II 

+ 

II 

- 

II 

+ 

• I 

- 

II 

+ 

• 1 

- 

II 

+ 

II 

- 

II 

+ 

II 

- 

II 

+ 

II 

+ 

II 

0 

It 

+ 

II 

0 

II 

+ 

• I 

0 

II 

+ 

II 

0 

II 

+ 

II 

0 

II 

+ 

II 

_ 

II 

inconsistent 

inconsistent 

inconsistent 

inconsistent 


aFor  read  "poor";  for  "+"  read  "good"}  for  "0"  read  "uncertain", 
kpi  = Programmed  Instruction. 
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Another  treatment  merely  gave  practice  (called  "activation")  in  noticing  and 
listing  details.  ATI  were  marked;  the  film  model  helped  students  initially 
low  in  cue  attendance  but  disrupted  already  skilled  students,  who  did  better 
with  simple  activation.  The  same  result  occurred  for  the  problem-solving 
outcome  in  the  first  study  but  not  in  the  second  (although  it  was  measured 
differently  here).  Also,  in  the  second  study,  measures  of  G^  (Embedded  Figures) 
and  Gc  were  included,  and  these  gave  the  same  ATI  results  as  did  the  cue  atten- 
dance pretest.  It  was  hypothesized  that  giving  High  G students  a processing 
model  which  they  do  not  need  causes  interference  with  their  own  idiosyncratic 
processing.  Also  noted  was  the  finding  that  scanning  by  able  students  appeared 
more  organized  after  model  training;  they  seemed  to  organize  their  search  accor- 
ding to  the  model  at  the  expense  of  listing  details,  while  less  able  students 
showed  the  opposite  pattern. 

A final  experiment  used  a film  model  to  demonstrate  the  sort  of  folding 
and  unfolding  of  three  dimensional  objects  often  found  in  measures  of  G^. 

Pre  and  posttests  were  two  kinds  of  Thurstone  G^  tests.  Two  other  pre  measures 
represented  Gc  (grade  point  average  in  language  and  mathematics  courses) . 

Modeling  seemed  to  improve  Gy  for  all  students,  with  no  ATI.  But  language 
grade  interacted;  it  was  positively  related  to  the  Gy  outcome  measure  in  a 
control  group  and  negatively  related  in  the  model  group.  Salomon  suggested 
that  the  normally  logical,  sequential  processing  of  high  verbal  students 
might  be  disrupted  by  attempts  to  use  the  spatial  operations  given  by  the  model. 
It  is  not  clear  why  mathematics  grade  would  not  have  given  the  same  result, 
except  that  higher-level  mathematical  ability  (and  presumably  also  mathematics 
achievement)  has  been  found  related  to  both  Gc  and  G^  (i.e.,  it  may  be  close 
to  Gf).  For  a review  of  G -mathematics  relations,  see  Aiken  (1971). 

The  work  of  Greeno  and  Mayer  serves  several  task  analytic  purposes  in 
instructional  research  in  addition  to  the  interest  at  hand.  In  one  study, 

Mayer  and  Greeno  (1972)  compared  a rational-conceptual  method  with  an  algorith- 
mic formula  method  for  teaching  binomial  probability  by  CAI  to  college  students. 
They  obtained  ATI  using  a pretest  on  probability  and  permutations,  but  not  with 
a mathematical  ability  test  or  with  self-reported  SAT-Q  score  as  aptitudes. 
Students  with  high  pretest  scores  did  better  with  concepts,  and  worse  with  the 
algorithms  than  did  students  with  low  pretest  scores.  Apparently,  the  more 
rational  treatment  required  knowledge  of  prerequisite  probability  concepts  and 
permutations,  hence  was  best  for  Highs.  The  algorithmic  approach  did  not  require 
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these  prior  structures  and  so  was  better  for  Lows.  It  can  be  hypothesized 
further  that  Highs  did  poorly  with  algorithms  because  the  latter  did  not  fit 
the  modes  of  operation  preferred  by  these  students,  or  because  they  were  not 
rationalized  as  meaningful  in  relation  to  students'  prior  concepts.  As  in 
some  other  studies,  a kind  of  learning  hierarchy  may  be  implied  by  this  ATI 
finding,  where  alternative  treatments  are  best  at  different  stages  of  knowledge 
acquisition . 

The  authors  went  on  to  suggest  that  the  two  treatments  produce  different 
structural  outcomes.  The  conceptual  condition  evidently  produced  a cognitive 
structure  with  good  external  connectedness  (with  components  easily  separable 
for  use  in  problem  solving).  The  formula  condition  produced  good  internal 
connectedness  (easier  application  of  the  concept  as  a whole). 

Egan  and  Greeno  (1973)  then  compared  a rule  method  with  a discovery  method 
in  a similar  situation.  Again,  specific  pretests  and  SAT-Q  were  the  aptitudes. 
The  rule  group  was  taught  the  formula  for  combining  binomial  probabilities 
while  the  discovery  group  had  to  infer  it  from  examples.  A similar,  though 
ordinal,  ATI  occurred.  The  posttest  had  strong  relation  to  aptitude  (both 
specific  and  general)  in  the  discovery  group  and  almost  none  in  the  rule  group, 
so  discovery  was  particularly  disadvantageous  for  Lows.  Again,  structural 
differences  in  outcome  were  noted.  Discovery  learning  was  characterized  as 
producing  external  connectedness  similar  to  the  outcome  for  conceptual  learning 
in  the  earlier  study.  This  was  interpreted  as  supporting  Gagne's  view  that 
the  learning  of  meaningful  concepts  and  the  discovery  of  principles  result  in 
similar  outcomes.  Rule  learning  gave  better  internal  connectedness,  as  did  the 
earlier  formula  treatment. 

A further  distinctive  feature  of  this  study  was  the  use  of  a multifaceted 
transfer  measure,  with  problems  stated  in  both  words  and  symbols,  and  of  both 
near  and  far  transfer  (Luchins)  types.  There  were  interactions  between  apti- 
tude and  type  of  problem.  The  combination  of  aptitudes  related  to  performance 
on  all  kinds  of  problems,  but  related  most  strongly  to  the  far  transfer-Luchins 
problems.  As  many  other  studies  'nave  suggested,  regression  slopes  were  steeper 
in  discovery  than  in  rule  conditions. 

A smaller  study  was  also  reported  here  using  programmed  booklets  instead 
of  CAI . Specific  pretest  gave  roughly  similar  ATI  trends  as  in  the  main  study, 
but  the  more  general  aptitude  tests  did  not  interact. 
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Mayer,  Stiehl,  and  Greeno  (1975)  have  produced  extensive  further  work  along 
these  lines.  There  were  four  experiments,  all  with  college  students. 

In  the  first  study  (N=44),  students  received  CAI  instruction  in  binomial 
probability  using  a formula  or  a general  concepts  program.  A crossed  treatment 
variable  concerned  whether  or  not  questions  were  interspersed  in  the  text.  Again, 
a faceted  posttest  allowed  qualitative  distinctions  in  learning.  Aptitudes 
were  self-reported  SAT-Q,  two  specific  pretests  on  probability  and  arithmetic 
concepts,  and  a permutations  test.  There  were  no  main  effects  for  treatment. 
Dividing  students  into  !,ow.  Medium,  and  High  on  each  aptitude  in  turn,  ATI 
results  quite  like  those  of  Egan  and  Greeno  were  obtained.  On  the  probability 
and  permutations  pretests,  the  formula  condition  was  best  for  Lows  while  the 
general  concepts  program  was  best  for  Highs.  The  arithmetic  pretest  gave 
the  same  pattern  for  Highs  but  treatments  did  not  differ  for  Lows.  Self- 
reported  SAT-Q  did  not  interact.  Beyond  its  value  as  replication,  this  study 
is  important  in  another  respect;  it  probed  further  into  the  possibility  that 
aptitude-treatment-posttest  interactions  (ATPI)  might  occur.  If  AT  combinations 
resulted  in  qualitatively  different  learning  outcomes,  then  aptitude  would  be 
relevant  not  only  to  amount  learned  in  a treatment  but  also  to  the  kind  of 
learning  produced  by  a treatment.  However,  ATPI  were  for  the  most  part  non- 
significant. Some  trends  suggested,  for  example,  that  among  medium  aptitude 
students  (on  the  probability  pretest),  the  formula  treatment  produced  higher 
performance  on  questions  stated  in  formula  form  while  the  general  concepts 
treatment  was  better  for  questions  stated  as  stories,  with  no  such  differences 
among  low  and  high  aptitude  students.  Had  such  patterns  occurred  at  other 
levels,  or  more  importantly,  had  different  patterns  occurred  at  different  apti- 
tude levels,  a whole  series  of  important  further  questions  would  arise  in  ATI 
work.  Faceted  posttests  of  the  sort  used  by  Greeno,  Mayer,  and  their  coworkers 
should  become  routine  in  instructional  research, including  ATI  studies.  The 
ATPI  possibility  remains  an  important  question  to  ask,  especially  in  studies 
of  longer  duration. 

The  other  three  studies  of  Mayer,  Stiehl,  and  Greeno  dealt  with  manipu- 
lated prior  experience  rather  than  its  measurement  via  aptitude  tests.  In 
one  (N  • 90),  students  received  no  introductory  problems,  some  introductory 
problems  but  no  feedback,  or  problems  with  feedback  and  correction.  The  same 
instructional  treatments  as  above  were  then  applied.  Again,  main  effects 
were  not  significant,  but  the  interaction  of  prior  experience  and  treatment 
was  significant  and  similar  in  some  respects  to  the  earlier  ATI  findings. 
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Students  without  prior  experience  did  better  with  the  formula  treatment. 

The  treatments  did  not  appear  to  differ  for  experienced  students.  There  were 
also  some  exper ience-treatment-posttest  interactions,  indicating  that  the 
nature  of  instruction  made  some  qualitative  difference  in  outcome  for  students 
with  prior  experience,  but  not  for  those  without  it.  The  other  studies  (N  * 50; 
40)  gave  preinstruction  in  formula  writing  or  in  the  meaning  of  concepts,  or 
both  or  neither,  and  then  placed  all  students  in  a common  instructional  sequence. 

This  was  designed  to  show  the  difficulty  of  different  steps  in  learning.  This 
kind  of  pretraining  manipulation  seemed  to  produce  a set  to  learn  particular 
aspects  of  the  material.  Formula  experience,  for  example,  produced  strong  per- 
formance on  formula  questions  but  poor  performance  on  story  problems,  while  no 
experience  or  experience  in  both  treatments  gave  better  results  than  did 
formula  experience,  particularly  on  story  problems.  General  concept  experience 
alone  was  best  for  story  problems. 

From  all  this, the  authors  draw  the  following  three-part  conclusion: 

1)  ATI  results  suggest  that  formula  instruction  should  be  given  to  low 
aptitude  students,  to  increase  total  amount  learned. 

2)  If,  however,  ATI  arises  from  fairly  specific  kinds  of  prior  knowledge, 
a better  policy  would  be  to  provide  low  students  with  training  to  remedy  this. 

3)  This  is  so  because,  while  the  formula  treatment  produces  skillful 
performance,  it  does  not  produce  more  general  understanding,  and  so  does  not 
prepare  the  low  aptitude  student  for  later  learning. 

This  reasoning  is  sensible  enough  as  far  as  it  goes.  It  emphasizes 
the  important  role  that  prior  achievement  may  play  as  aptitude,  a point 
also  made  by  Tobias  (1976)  in  a review  of  other  such  studies.  And,  it 
harks  back  to  the  Brownell  and  Moser  (1949)  results  (discussed  by  Cronbach 
and  Snow).  In  a multi-school  comparison  of  rote  vs.  meaningful  arithmetic 
instruction,  they  found  that  some  third-graders  seemed  unable  to  profit 
from  explanations.  Their  prior  rote  training  had  apparently  developed  an 
inaptitude  for  later  meaningful  instruction.  Thus,  short-run,  expedient 
decisions  about  alternative  treatments  may  be  detrimental;  one  needs  to 
take  a longer  view. 

But  a longer  view  involves  at  least  three  other  issues,  and  the  present 
results  do  not  address  them.  First,  learning  time  in  the  Mayer  and  Greeno 
experiments  is  measured  in  minutes  while  school  learning  is  measured  in 
months.  In  school  learning,  it  is  important  to  ask:  How  much  prior  training 
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will  be  required  for  students  low  initially  on  a specific  achievement  pre- 
test? How  far  will  this  training  carry  them  into  the  next  block  of  instruc- 
tion? How  far  behind  other  students  will  they  be  as  a result?  When  specific 
inaptitudes  can  be  made  up  easily,  without  other  costs,  that  is  clearly  the 
best  line  for  the  educator  to  take  in  applying  ATI  data.  But  if  remedial 
instruction  becomes  the  predominant  mode  for  some  students,  other  kinds  of 
alternate  treatments  need  to  be  considered. 

Second,  and  perhaps  more  importantly,  one  should  ask:  Do  not  both 

high  and  low  students  initially  need  ultimately  to  have  both  kinds  of  instruc- 
tion? An  old  study  by  Edgerton  (1956;  see  Cronbach  and  Snow)  suggested 
just  this.  A 14-week  course  for  weather  observers  was  taught  in  two  alterna- 
tive sequences;  theoretical  explanations  first,  then  practical  techniques,  or 
the  opposite.  On  a faceted  posttest  much  like  those  used  by  Greeno  and  Mayer, 
items  could  be  divided  into  "How"  questions  and  "Why"  questions.  On  "How” 
questions,  the  theory-first  treatment  was  especially  beneficial  to  students 
low  on  numerical  ability  (G^?).  This  treatment  had  apparently  given  these 
students  concepts  that  would  help  them  when  they  reached  computation.  On 
"Why"  questions,  the  techniques-f irst  treatment  was  better  for  students  low 
on  reasoning  ability  (G^?).  They  were  allowed  to  master  computation  to  help 
when  conceptual  reasoning  was  required.  The  implication  is  that  complex 
instruction  should  start  with  the  treatment  that  avoids  each  student's  weak- 
ness, even  if  both  treatments  need  ultimately  to  be  given.  Another  Edgerton 
(1958)  study  compared  rote  vs.  meaningful  explanation  in  an  aviation  mechanics 

course.  As  it  turned  out,  C related  more  strongly  to  performance  in  the  rote 
treatment  than  in  the  meaningful  treatment.  Apparently,  explanation  helped 
Lows  while  Highs  generated  their  own  explanations. 

Third,  tests  of  specific  prior  knowledge  must  be  connected  to  the  more 
general  fabric  of  individual  differences.  One  cannot  study  mathematical  ability 
and  achievement  in  isolation  just  because  it  appears  to  be  more  directly 
relevant.  Ordinarily,  all  measures  of  prior  knowledge  including  specific 
pretests  can  be  thought  of  as  representing  Gc>  It  would  be  convenient  to 
do  so  here,  but  for  the  apparent  fact  that  the  ATI  findings  came  mostly 
from  the  specific  pretests  rather  than  the  general  tests.  However,  the 
four  aptitude  tests  used  were  undoubtedly  intercorrelated  and  one  can 
speculate  that  ATI  was  mainly  associated  with  the  principal  component  of 
these.  Hence  the  simplest  interpretation  of  the  results  should  rest  on 
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G G.,  until  future  research  shows  conclusively  that  general  constructs 
c I 

cannot  account  for  specific  ATI.  As  noted  earlier,  mathematical  abilities 

often  seem  to  combine  G , G,..  and  G . 

c f v 

The  most  recent  work  in  this  series  comes  from  Mayer  (1975).  He  taught 
computer  programming  to  undergraduates  using  a diagrammatic  model  of  a compu- 
ter expressed  in  familiar  terms  as  one  treatment,  and  a condition  using  a rule 
text  with  examples  but  no  overarcing  framework  or  model  as  the  other.  Three 
studies  were  reported,  each  using  extremely  brief  instructional  booklets. 

The  first  study  (N=86)  compared  two  model  and  two  rule  treatments,  one 
of  each  including  an  introduction  to  flowchart  symbols.  Aptitudes  were  SAT-Q, 
and  tests  on  algebra  computation,  algebra  word  problems,  card  trick  problems, 
and  permutation  ordering.  The  posttest  required  generation  and  interpretation 
of  programs,  with  and  without  loops.  A main  effect  favored  the  model.  It  was 
not  clear  Inw  ATI  was  tested;  presumably  separate  analyses  treated  aptitude  as 
a two-level  blocked  factor,  since  later  studies  did  this.  Mayer  reported 
there  were  no  ATI.  SAT-Q  and  the  algebra  tests  did,  however,  give  substantial 
correlations  with  various  facets  of  the  outcome  measure. 

In  the  second  study  (N=40) , aptitude  was  represented  in  the  design  by 
SAT-Q,  split  into  high  and  low  ability  blocks  at  score  560.  Again,  two  rule 
and  two  model  groups  were  compared,  one  of  each  including  an  introductory 
program.  There  were  two  program  learning  exercises.  An  additional  outcome 
measure  was  a transfer  set  of  programming  exercises.  There  was  no  main  effect 
due  to  the  model,  but  Lows  made  fewer  errors  after  instruction  with  the  model 
while  Highs  were  better  off  without  the  model.  This  pattern  occurred  in  both 
learning  exercises  as  well  as  the  transfer  exercises.  (The  trend  was  not 
statistically  significant  on  transfer,  but  a small  N design  with  aptitude 
blocked  does  not  provide  a powerful  test.) 

The  third  study  (N  » 56)  used  essentially  the  same  design  and  measures  but 
crossed  rat' tel  vs.  rule  text  with  two  types  of  practice  (interpretation  vs. 
generation  oi  programs).  The  same  ATI  was  obtained.  Also,  the  three-way  inter- 
action was  noteworthy;  while  the  rule  treatment  was  best  for  Highs  and  the 
model  treatment  was  best  for  Lows,  in  the  generation  practice  condition,  the 
rule  treatment  was  best  for  both  ability  levels  in  interpretation  practice. 

Mayer  concluded  that  high  ability  students  may  be  hurt  by  the  model  because 
they  already  have  their  own  idiosyncratic  "models",  in  the  form  of  rich  prior 
knowledge.  Low  ability  students  seem  to  experience  no  such  interference. 
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All  these  studies  hint  that  treatments  based  on  simplified  models,  algor- 
ithms, direct  rules  to  follow,  etc.,  may  be  detrimental  to  High  G students, 
even  while  they  help  Low  G students.  This  is  a special  case  of  the  general 
hypothesis  stated  earlier  about  the  locus  of  the  information-processing  burden, 
student  or  treatment.  It  is  an  especially  interesting  case  because  it  implies 
negative  effects  on  High  G students.  Some  kind  of  interference  phenomenon, 
operating  between  what  the  model  tries  to  help  students  do  and  what  students 
would  ordinarily  do  on  their  own,  seems  to  be  responsible.  But  the  sources 
of  this  interference  cannot  be  pinned  down  more  exactly  at  present.  Nor  is 
it  clear  that  the  phenomenon  is  more  associated  with  G^ , as  might  have  been 
hypothesized  following  >ne  Salomon  study.  All  of  the  studies  show  the  effect 

with  G . One  of  Salomon's  studies  failed  to  show  it  with  G . But  none  of 
c v 

these  studies  effectively  distinguishes  G and  G,,  or  any  of  these  general 

c f 

constructs  from  the  special  pretests  preferred  in  some  studies.  Such  dis- 
tinctions will  be  important  in  future  research;  sharpening  the  aptitude  con- 
structs at  work  in  ATI  where  Highs  are  hindered  by  a model  or  algorithm 
should  help  define  the  interference  mechanisms  that  produce  such  results. 

Two  other  studies  bear  more  directly  on  the  G G,  distinction.  But  thev 

C r J 

fail  to  make  it  clear. 

Skanes  et_  al_  (1974)  used  a brief  treatment  to  explain  and  give  practice 
in  solving  problems.  Over  2000  students  in  Grades  5 to  9 participated.  Some 
students  received  practice  on  letter  series,  some  on  number  series.  These 
groups  were  further  subdivided,  to  form  pretest  and  no-pretest  groups.  A 
fifth  group  served  as  control.  Pretest  and  posttest  were  forms  on  Thurstone's 
Letter  Series  Test.  Aptitudes  were  the  Raven  and  Otis  mental  tests,  the  first 
of  which  is  often  considered  as  representing  G^.,  the  second  as  G or  G^ . Mul- 
tiple regression  analysis  showed  two  significant  ATI.  It  appeared  that 
students  high  on  Otis  were  helped  by  having  the  pretest  while  students  low 
on  Otis  were  better  off  without  it.  Also,  students  low  on  Raven  were  helped 
by  direct  training  on  letters  while  those  high  on  Raven  derived  more  benefit 
from  the  indirect  training  using  numbers.  The  analysis  did  not  distinguish 

G,  and  G satisfactorily;  Otis  and  Raven  were  undoubtedly  correlated.  Thus 
f c 

the  results  must  be  attributed  for  the  time  being  to  an  undifferentiated  G. 

If  the  direct  vs.  indirect  variation  could  be  shown  to  interact  specifically 
with  Gf,  it  would  imply  that  the  information-processing  burden  hypothesis 
discussed  previously  applies  to  G^  as  well  as  to  G^. 
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Christ -Whitzel  and  Hawley-Winne  (1976)  conducted  a year-long  evaluation  of 
individually  prescribed  instruction  (IPI)  in  mathematics,  using  124  sixth  graders. 
Treatments  were  IPI,  a mathematics  management  and  support  system  which  related 
different  objectives  to  available  curriculum  materials,  and  provided  individual 
and  small-group  as  well  as  whole-class  teaching,  and  a traditional  basal  text 
with  whole  class  teaching.  Each  treatment  was  applied  in  three  classes  of  a 
different  school.  Two  forms  of  the  Comprehensive  Test  of  Basic  Skills  served 
as  pre-  and  posttest,  administered  in  Spring  of  the  prior  year  and  in  Spring  of 
the  treatment  year.  This  gave  a total  score  and  subscores  for  computation,  con- 
cepts, and  applications.  While  these  were  used  as  separate  outcome  measures, 
the  total  score  alone  was  used  as  aptitude.  Also,  an  IQ  score  available  at 
pretest  correlated  highly  with  the  pretest  scores,  so  it  was  not  included.  The 

pretest  total  score  can  probably  be  thought  of  as  G . Other  aptitudes  were 

c 

Embedded  Figures  (G^)  and  questionnaires  giving  several  self-concept  and  locus- 
of-control  scores. 

Multiple  regression  analyses  showed  G^  accounting  for  most  of  the  variance 
in  each  outcome  measure,  with  G^  adding  significantly  each  time.  Treatment 
main  effects  showed  traditional  instruction  highest  and  IPI  lowest,  but  these 
differences  also  existed  at  pretest  so  no  average  conclusion  is  justified.  ATI 
appeared  for  both  G^  and  G^.,  and  they  seemed  to  run  in  opposite  directions. 

For  total  outcome  and  computation.  Low  C students  were  better  served  by  IPI 
while  High  G students  did  better  in  traditional  classrooms.  After  extracting 
this  interaction,  however.  High  G{  students  seemed  to  do  better  in  IPI'.  For 
the  other  two  outcomes  these  aptitudes  continued  to  show  main  effects.  The 
ATI  trends  were  as  before  but  very  faint.  Self  concept  showed  one  other  ATI; 
on  the  concepts  subtest,  the  curriculum  management  treatment  was  best  for 
Highs,  while  the  other  treatments  were  best  for  Lows. 

The  analysis  is  an  Interesting  demonstration  of  ATI  used  in  evaluation,  but 
it  does  not  go  far  enough  for  the  present  purposes.  Without  knowing  correlations 
among  all  the  pretest  measures,  or  sharply  distinguising  Gf  and  Gf  as  predictors, 
the  implication  is  not  fully  tested.  An  important  hypothesis,  however,  for  re- 
analysis and  for  further  research,  is  that  individualization  (via  IPI)  helps 
those  who  have  not  learned  well  in  prior  instruction  but  who  have  high  fluid 
ability  ("underachievers"?);  It  is  less  effective  for  those  high  in  prior 


achievement  and/or  low  in  fluid  ability.  This  result  would  replicate  the  old 
Anderson  (1941)  study,  which  gave  a similar  implication  in  a comparison  of  drill 
vs.  meaningful  instruction.  As  will  be  clear  in  the  methodology  section  to  come, 
however,  the  Anderson  finding  does  not  withstand  reanalysis. 

This  section  can  be  closed  with  a series  of  studies  dealing  with  the  decep- 
tively simple  distinction  between  Gc  and  Gv<  The  most  popular  special  ability 
hypothesis  in  ATI  research  has  been  that  a verbal  vs.  visual  distinction  among 
abilities  should  interact  with  similar  distinctions  in  instructional  materials. 

The  simplest  hypothesis  was  that  High  G -Low  G students  would  excel  in  visual 

v c 

simplest  hypothesis  was  that  High  G^-Low  G^  students  would  excel  in  visual 

treatments  while  Low  G -High  G students  needed  verbal  treatments.  The  Cronbach- 

v c 

Snow  review  showed  much  inconsistency  in  attempts  to  verify  this  (see  Table  2) 
and  suggested  that  the  hypothesis  might  often  be  stated  in  the  opposite  way. 
Visual  treatments  might  at  times  compensate  for  Low  G^;  verbal  treatments  might 
compensate  for  Low  G . Difficulty  of  the  material  would  be  one  important  moder- 
ating factor  in  such  hypotheses.  Also,  general  vs.  special  ability  interpre- 
tations needed  to  be  clarified. 

Gustafsson  (1974;  1976)  has  pursued  this  problem  in  three  studies  done  in 
Sweden.  The  work  includes  a thorough  rethinking  of  ATI  in  this  domain,  and 
a methodological  comparison  of  various  regression  and  blocked  anova  approaches. 
The  first  study  (N=316  seventh  graders)  used  a 9-page  text  on  the  polar  lights 
phenomenon  as  a verbal  treatment,  with  a slightly  reduced  text  plus  17  illus- 
trations (16  pages)  as  a pictorial  treatment.  Aptitudes  were  Opposites  (G^) , 
Number  Series  (G^.?),  and  Metal  Folding  (G^)  , plus  some  interest  scales.  Out- 
comes were  short-answer  and  essay  tests  administered  directly  after  instruction. 

G showed  stronger  relation  to  achievement  in  the  pictorial  treatment,  while  G 
v c 

was  more  highly  related  to  performance  in  the  verbal  treatment,  particularly 
on  the  essay  outcome.  Number  Series  gave  results  rather  like  those  of  G^. 

This  finding,  then,  demonstrates  the  traditional  expectation.  But  some  special 
concerns  about  the  study  led  to  a reanalysis  (summarized  in  Gustafsson,  1976). 
This  allowed  quadratic  terms  and  also  checked  some  effects  by  blocking  on  apti- 
tude. The  results  for  the  essay  test  remained  essentially  unchanged;  verbal 
instruction  was  best  for  students  low  on  G^  and  high  on  G^  while  pictorial 
instruction  was  best  for  students  high  on  G^  and  low  on  G^ . But  this  effect  had 
to  be  attributed  in  part  to  interference  from  some  association  tests  which  had 
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been  administered  during  instruction  in  some  groups;  they  apparently  had  had  a 
negative  effect  on  the  reading  strategies  of  High  G -Low  G^  students  and  could 
have  accentuated  the  ATI.  Results  for  other  criteria  were  modified  in  a complex 
way.  Apparently,  verbal  instruction  served  High  G^-Low  G^.  students  best  on  the 
short  answer  test.  A complex  three-way  interaction  among  the  aptitudes  was 
regarded  by  the  author  as  due  to  measurement  problems.  Finally,  for  those  short- 
answer  items  that  dealt  directly  with  the  illustrated  material,  pictorial  instruc 

tion  was  best  for  Low  G students  and  for  Medium  G students  who  were  high  on  G . 

c c v 

This  was  apparently  regarded  as  the  most  trustworthy  of  the  results. 

Gustaffson's  (1976)  second  study  (N=201  seventh  graders)  sought  to  replicate 
these  findings  using  the  same  instructional  material  and  the  same  three  aptitudes 
The  short-answer  test  was  again  divided  into  items  bearing  on  illustrated  vs. 
noni llustrated  material.  This  plus  the  essay  test  were  administered  as  immediate 
criteria  and  again  eight  days  later  as  retention  measures.  The  pictorial  treat- 
ment produced  somewhat  higher  achievement  on  all  criteria.  Overall,  ATI  were 
not  significant.  However,  the  score  for  illustrated  short-answer  items  gave 
steeper  regression  slopes  on  G^  in  the  verbal  treatment  than  in  the  pictorial 
treatment,  so  students  low  on  G did  best  with  pictures.  This  is  consistent 
with  the  reanalyzed  results  of  the  first  study.  The  effect  reached  borderline 
significance  at  retention.  As  in  the  first  study,  G showed  a tendency  to 
augment  this  effect,  so  that  the  best  performances  in  the  pictorial  treatment 
among  students  low  on  G^  occurred  among  those  also  high  on  G^.  The  results  for 
essay  were  mixed  and  failed  to  conform  to  earlier  findings. 

A third  study  (N=229)  used  new  instructional  materials  on  the  heart  and 
blood  circulation  system.  Again,  brief  text  materials  were  presented  in 
illustrated  and  unillustrated  versions.  In  contrast  to  previous  studies,  the 
subject-matter  was  especially  designed  to  be  "spatially  demanding",  on  the 
hypothesis  that  Low  G^  students  would  thus  profit  most  from  pictures.  Im- 
mediate outcome  measures  yielded  three  scores:  verbal  content  achievement, 

verbal  achievment  on  spatially  demanding  content  (called  "spatial"),  and 
pictorial  content  achievement.  Aptitude  tests  used  and  factor  analytic  results 
for  them  are  shown  in  Table  3.  Gustafsson  chose  to  use  simple  aptitude 
scores  in  some  ATI  analyses,  factor  scores  in  others, and  some  combinations 
of  simple  scores  in  still  others. 


Table  3 about  here 
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Table  3 


Varimax  Rotation  Results  from  Factor  Analyses  Reported  by 
Gustaffson  (1976),  Satterly  (1976),  and  Das  and  Molloy  (1975) 


Tests3 

G G, 

c f 

G 

V 

PA  PS  M 

Gustaffson 

Opposites 

.83* 

.16 

.07 

Seventh  Graders 

Reading  Speed 

.80* 

-.04 

.15 

(N=229) 

Number  Series 

.66* 

.37* 

.09 

Metal  Folding 

• .42* 

.53* 

.25 

Figures 

.15 

.82* 

.01 

Hands 

.03  ' 

.77* 

.01 

Word  Pair  Associates 

.21 

-.01 

.88* 

Picture  Pair  Associates 

.05 

.12 

.90* 

Sacter ly 

Verbal  Intelligence 

.76* 

.26 

-.08 

.05 

Fifth-Sixth  Graders 

English  Comprehension 

.8  1* 

-.07 

.27 

.03 

(N=201) 

Picture  Vocabulary 

.79* 

-.01 

.38* 

.16 

Mathematics  Attainment 

.67* 

.30* 

.09 

.04 

Embedded  Figures 

.31* 

.77* 

-.04 

.13 

Haptic  Perception 

. 32* 

.48* 

.23 

.18 

Analytic  Preference 

-.04 

.64* 

.10 

-.18 

Gottschaldt  Figures 

-.02 

.48* 

.52* 

.25 

Spatial  Judgment 

.34* 

.13 

.65* 

.25 

Spatial  Ability 

.24 

.03 

.87* 

-.08 

Perceptual  Speed 

. 1 1 

.01 

.10 

.93 

Das  and  Molloy  h 

Verbal  IQ 

. r>y* 

.08 

-.16 

-.12 

Fourth  Graders 

Backward  Digit  Span 

. 76* 

.28 

-.03 

.14 

(N’=60) 

Bridge  Task 

-.  54* 

. 34* 

.04 

-.  18 

Performance  IQ 

.22 

.57* 

.08 

-.23 

Raven  Matrices 

-.01 

.86* 

-.02 

-.02 

Figure  Copying 

.07 

.75* 

.18 

.00 

Memory  for  Designs  (Errors  ) 

-.03 

-.70* 

.03 

.07 

Cross-Modal  Coding 

.01 

.64* 

-.11 

.14 

Word  Reading 

.03 

.04 

.84* 

-.14 

Color  Naming 

-.16 

.06 

.78* 

.01 

Forward  Digit  Span 

-.14 

.04 

.00 

.80* 

Visual  Short  Term  Memory 

. 14 

-.08 

-.22 

.67* 

Serial  Recall 

.05 

-.07 

.01 

.93* 

Free  Recall 

.08 

.00 

-.01 

.90* 

Notes : 


* Factor  loading  exceeds  .30. 

a Published  Test  Lists  have  been  reordered  here  to  place  similar  tests  together, 
b In  the  Das-Molloy  List,  two  variables  defining  an  SES  factor  have  been  omitted. 
PA  = Paired  associates  learning  factor. 

PS  ” Perceptual  speed  factor. 

M • Short  term  memory  or  memory  span  factor. 
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Various  analyses  failed  to  show  substantial  ATI.  There  was  a tendency 
toward  curvilinear  regression,  with  the  pictorial  treatment  giving  best  results 
on  the  spatial  criterion  for  students  scoring  either  low  or  high  on  the  Figures 
Test  (Gv)  . But  this  trend  also  seemed  present  for  the  verbal  criterion!  In 
general,  Gustafsson  concluded  that  for  verbal  and  spatial  criteria  there  were 
no  ATI.  For  the  pictorial  criterion,  there  was  ordinal  interaction;  the 
pictorial  treatment  was  best  for  everyone  but  least  so  for  students  high  on 
and  low  on  C^.  This  is  contrary  to  the  hypothesis,  and  it  does  not  seem 
to  match  up  with  the  results  of  the  earlier  studies.  The  conflicting  character 
of  these  studies  is  typical  of  research  on  the  verbal  vs.  visual  contrast,  as 
noted  earlier. 

Table  3 has  been  constructed  to  illustrate  a point  also  alluded  to  by  Gus- 
tafsson. Number  Series  and  Metal  Folding  had  correlations  with  members  of  both 

sets  of  tests,  and  the  factor  analysis  split  them  between  G and  G . This  seerr 

c v 

to  be  a typical  result,  not  specific  to  these  data.  One  might  conclude  that 

performance  on  these  tests  requires  both  G and  G , or  that  their  items  can  be 

c v 

solved  using  either  ability,  or  that  they  represent  abilities  involved  in  the 
other  two  abilities,  namely  G or  G^..  This  latter  possibility  is  often  lost  in 
such  analyses;  it  might  have  been  made  to  appear  in  the  Gustafsson  data  by  the 
addition  of  measures  such  as  the  Raven  or  Wechsler  Performance  IQ,  or  by  a fac- 
tor solution  closer  to  the  unrotated  axes  than  the  varimax  procedure  allows. 

Data  from  two  other  studies  are  shown  in  Table  3.  Satterly  (1976  sought 
to  distinguish  the  cognitive  style  called  "field  independence"  from  G and  then 
relate  it  to  mathematics  achievement.  This  style  construct  has  process  impli- 
cations, so  understanding  might  be  advanced  thereby.  But  his  choice  of  reference 
measures  produced  only  G^  and  G , so  his  style  factor  cannot  be  distinguished 
from  Gj  and  is  so  labeled  here.  As  in  the  Gustafsson  data,  several  measures 
including  the  mathematics  score  may  represent  either  or  both  of  two  abilities, 
or  a more  basic  one;  measures  such  as  Raven  or  Wechsler  Performance  IQ  are  again 
needed.  The  Das-Molloy  research  (1975;  see  also  Das,  Kirby,  & Jarman,  1975) 
represents  a unique  attempt  to  reinterpret  ability  factors  in  process  terms, 
using  Lurla's  distinction  between  simultaneous  and  successive  modes  of  informa- 
tion Integration  in  cognition.  They  interpret  what  is  here  labeled  G^  as  simul- 
taneous because  it  includes  several  tasks  that  involve  spatial-constructive  or 
multisensory  performances,  in  addition  to  the  traditional  G^  markers.  This  is 
a provocative  suggestion  for  research,  but  not  an  argument  for  creating  new 
factor  labels.  The  Bridge  Task  in  particular  may  signal  an  important  direction 
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for  such  research,  with  its  negative  loading  on  G^.  But  what  is  here  called 
was  interpreted  by  Das  and  Molloy  as  proficiency  in  spatial-imagery.  There  is 
no  Justification  for  this  in  the  data,  and  no  G^  measures  were  included  in  the 
battery.  The  factor  called  "successive"  must  be  listed  here  as  short  term  mem- 
ory; its  tasks  are  common  memory  measures,  which  are  admittedly  serial  in  character. 

Thus,  Table  3 illustrates  the  problems  endemic  to  much  of  differential  psy- 
chology. Aptitude  factors  can  be  extremely  useful  as  reference  points  for  sort- 
ing individual  differences.  But  alone  they  can  never  be  interpreted  as  unities, 
or  yield  process  theories,  or  sustain  new  interpretations.  Correlational  analysis 
alone  will  never  solve  these  problems.  What  is  required  are  process  descriptions 
of  how  students  solve  the  problems  represented  in  different  tests,  and  this  will 
require  introspective  interviews  as  well  as  much  experimental  analysis.  It  may 
be  found,  for  example,  that  Metal  Folding  problems  are  solved  by  some  students 
using  visualized  rotations,  and  by  others  using  analytic  reasoning.  This  is  the 
implication  of  early  research  by  French  (1965).  Until  aptitudes  as  well  as  treat- 
ments can  be  described  in  such  terms,  inconsistent  ATI  results,  and  consistent 
results  as  well,  will  not  be  satisfactorily  understood. 

In  sum,  work  on  the  G GrG  constructs  is  insufficient  to  justify  their 

c f v J J 

distinction  in  ATI  research.  The  recommendation  is  not  to  dispense  with  the 
distinction,  but  rather  to  sharpen  the  analytic  attacks  in  both  instructional 
and  laboratory  research.  No  ATI  study  has  yet  measured  the  three  adequately. 

At  the  same  time,  the  correlates  of  each  should  be  sought  in  the  broader  domain 
of  differential  psychology.  No  research  that  includes  a mental  measure  studies 
that  variable  in  isolation.  The  personal,  motivational,  and  social  correlates 
of  these  constructs  may  help  to  distinguish  them  from  one  another  and  to  connect  them 
to  other  instructionally  relevant  complexes,  such  as  A.A^A^.  A long  line  of 
research  has  sought  to  accomplish  this  for  the  verbal  vs.  quantitative  ability 
distinction  (see  McCarthy,  1975).  The  same  is  required  for  GcGfGv’ 

In  the  meantime,  the  in terpre tation  of  most  cognitive  tests  in  ATI  rests 
on  G.  And,  for  the  sake  of  parsimony,  any  study  investigating  a more  specialized 
aptitude  must  show  its  separation  from  G empirically.  The  only  other  cognitive 
abilitv  constructs  thus  distinguishable,  that  also  have  had  some  place  in  ATI 
research,  are  the  other  factors  listed  in  Table  3;  memory  measures,  perceptual 
speed,  and  some  specific  learning  performance  measures.  Memory  tests,  in 
particular,  have  occasionally  shown  distinct  interactions.  These  will  be 
touched  on  in  the  final  section  of  this  review. 
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Methodological  Developments 

Cronbach  and  Snow  argued  that  tests  of  ATI  hypotheses  would  have  to 
be  designed  as  large-scale,  long-duration,  real  school  studies.  Only  in 
this  way  could  stability  of  aptitude  and  treatment  effects  be  achieved,  with 
sample  sizes  sufficiently  large  for  powerful  statistical  tests,  in  settings 
likely  to  lead  to  practically  important  conclusions.  Studies  on  this  scale 
are  expensive  and  time-consuming,  so  they  occur  relatively  infrequently. 

But  they  are  increasingly  seen  as  worth  the  effort;  the  Peterson  and  Porteus 
studies  step  in  this  direction.  Also,  the  incorporation  of  an  ATI  viewpoint 
into  large-scale  evaluation  studies  has  increased,  and  secondary  analyses  of 
data  from  such  studies  has  helped  to  test  some  ATI  hypotheses  at  minimal 
extra  expense. 

In  all  ATI  work,  multiple  regression  methods  have  been  advocated  as  the 
most  general  and  appropriate  approach  to  data  analysis.  With  new  texts  now 
available  (see  Kerlinger  and  Pedhazur,  1973;  Amick  and  Walberg , ] 97 5 ; Borich, 
Godbout,  and  Wunderlich,  1976),  multiple  regression  methodology  is  coming 
into  routine  use. 

A new  look  at  regression  methodology  in  large  scale  studies,  however,  has 
introduced  some  new  problems  with  potentially  profound  implications  for  future 
research.  These  rroblems  arise  from  methodological  concerns,  but  they  have 
central  substantive  importance.  They  are  introduced  in  this  section,  to  spur 
interest  and  further  development,  but  they  cannot  be  resolved  here. 

Between-class  and  wl thin-class  regression  components.  The  large,  real 
school  studies  advocated  for  ATI  research  must  inevitably  include  multiple  cl.i 
rooms,  with  classes  assigned  to  alternative  instructional  treatments.  There  .-it, 
then  individuals  nested  within  classes  nested  within  treatments.  Virtually  a I 
research  of  this  character  has  ignored  class  groupings,  producing  pooled  re- 
gressions of  outcome  on  aptitude  for  individuals  under  separate  treatments. 

But  this  assumes  that  outcome  is  a function  of  individual  aptitude  only,  while 
class  composition  on  aptitude,  class  average  aptitude,  and  individual  teachers 
have  no  effect.  Theoretically,  one  could  posit  two  kinds  of  aptitude  effects 
in  addition  to  individual  effects:  comparative  effects,  wherein  a student's 

standing  on  aptitude  in  a particular  class  is  important,  if  e.g.,  the  teacher 
pays  primary  attention  to  those  students  above  average  on  some  aptitude  u> 
that  class;  and  class  effects,  wherein  class  average  aptitude  is  important. 
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if,  e.g.,  the  teacher  adopts  a fast  pace  in  a class  with  a high  average  on 
some  aptitude.  The  pooled  individual  regressions  for  treatments  combine,  and 
mask,  both  comparative  effects  and  class-level  effects.  Clearly,  a methodol- 
ogy for  multi-class  studies  is  required  that  separates  between-  and  within- 
class  regressions,  so  that  alternative  causal  explanations  for  ATI  in  any  given 
instance  can  be  examined.  Furthermore,  it  becomes  clear  that  single-class 
experiments  are  unable  to  distinguish  among  these  alternative  sources  of  ATI, 
because  between-class  effects  cannot  be  detected  here.  In  such  studies, 
routine  interpretation  of  ATI  as  bearing  on  individual  phenomena  is  unjustified. 

Cronbach  and  Webb  (1975)  reanalyzed  data  from  an  early  study  by  Anderson 
(1941)  that  had  seemingly  shown  a marked  and  important  ATI.  In  the  Anderson 
study,  18  fourth  grade  classes  were  assigned  to  drill  vs.  meaningful  instruc- 
tion in  arithmetic.  Using  an  achievement  pretest  and  a general  ability  measure 
as  aptitudes,  and  an  achievement  posttest  as  outcome,  Anderson  computed  re- 
gression slopes  at  the  individual  level  ignoring  class  membership,  and  obtained 
ATI.  Students  high  on  prior  achievement  and  low  on  ability  ("overachievers") 
did  best  in  drill,  while  students  low  on  prior  achievement  and  high  on  ability 
("underachievers")  did  best  with  meaningful  instruction.  This  result  would 
be  important  still  today,  implying  as  it  does  that  students  high  on  and  low 
on  should  be  given  drill  instruction,  a method  in  which  they  had  already 
displayed  progress  (drill  being  prevalent  in  arithmetic  instruction  in  the 

late  1930's),  while  students  high  on  G.  and  low  on  G should  be  switched  to 

r c 

the  more  innovative,  meaningful  instruction.  The  Anderson  study  is  one  of  the 

few  large,  real  school  studies  bearing  on  the  hypotheses  discussed  earlier  in 

the  seer  ion  on  the  G G,G  complex. 

c f v r 

The  Cronbach-Webb  reanalysis  (N“434)  estimated  between-c lass  regressions 
to  test  class-level  effects,  and  within-class  regressions  to  examine  comparative 
effects.  This  analysis  does  not  separate  these  and  individual  effects  completely, 
but  does  give  us  hints  as  to  which  effects  are  likely  to  be  worth  further  atten- 
tion. In  brief,  the  individual  ATI  implied  by  the  Anderson  analysis  vanished. 

The  class-by-class  regressions  varied  greatly,  so  that  the  pooled  within-class 
withln-t reatment  regression  slopes  gave  only  a fraction  of  the  trend  reported  by 
Anderson's  analysis.  At  the  between-class  level,  the  small  number  of  class 
means  and  some  statistical  anomalies  of  the  Anderson  sample  made  a conclusion 
for  or  against  ATI  unjustified.  The  reanalysis  thus  wipes  away  the  Anderson 
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study  as  support  for  an  ATI  hypothesis  involving  and  Gf.  More  importantly, 
it  cues  present  researchers  to  attend  to  complexities  in  ATI  research,  and  in 
all  classroom  research,  not  dreamed  of  only  a short  time  ago. 

A contemporary  demonstration  of  this  complexity  comes  from  a study  by 
Greene  (1976).  She  investigated  the  effect  of  giving  students  their  own 
choice  about  when,  for  how  long,  and  in  what  sequence,  they  would  work  on 
booklet  lessons  in  class.  Nine  fourth  and  fifth  grade  classes  were  divided 
randomly  into  choice  and  no-choice  groups.  The  no-choice  groups  worked  on 
booklets  designed  to  teach  the  solving  of  Thurstone  letter  series  problems 
under  teacher  direction  and  scheduling.  The  choice  students  could  schedule 
themselves.  Aptitudes  included  a pretest  on  letter  series  and  Lorge-Thornd ike 
Verbal  IQ.  Outcome  included  a posttest  on  the  workbook. and  a measure  of  intere; 
in  doing  further  similar  work.  Various  motivational  measures  served  as  both 
pre-  and  posttests.  The  study  spanned  four  weeks  of  class  time  with  N=165. 

Figure  5 shows  a representative  result.  Aptitude  here  is  G combining 
Lorge-Thorndike  and  the  letter  series  pretest.  Outcome  is  a factor  combining 
the  letter  series  posttest  and  measures  reflecting  a student's  confidence  about 
school  work  (A').  The  regression  lines  spanning  the  figure  show  the  pooled 
individual  slopes,  for  no-choice  (solid)  and  choice  (dashed)  students.  The 
regressions  identified  at  the  margins  are  between  are  between-c lass  slopes 
fitted  to  the  half-class  means  for  each  treatment.  The  thin  lines  show  within 
half-class  regressions,  drawn  to  + 1 S.D.  around  the  numbered  half-class  means. 


Figure  5 about  here 

Only  the  pooled  individual  slopes  are  based  on  enough  points  to  be  trust- 
worthy. Yet  the  impression  of  within-class  heterogeneity,  and  of  the  diver- 
gence of  between-class  and  pooled  individual  slopes  is  striking.  Attending 
only  to  Individual  data,  there  is  clearly  no  ATI.  The  between-class  slopes, 
however,  suggest  a major  effect.  One  might  say  that  choice  in  organizing  one's 
own  work  should  be  given  to  High  G groups,  while  the  teacher  should  do  the  organ 
izing  for  Low  G groups.  But  this  may  be  so  here  only  because  both  treatments 
exist  within  each  class;  the  between-group  slope  for  choice  may  appear  in  a 
steeper  position,  and  that  for  no-choice  in  a shallower  position,  only  because 
the  social  contrast  is  perceivable  by  the  members  of  each  half-class  group  in 
the  classroom.  It  seems  likely  that  such  a social  contrast  would  work  in  oppo- 
site directions  in  classes  differing  on  average  ability.  That  is,  choice  may 
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Between  class,  no-choice 


Pooled  within  no-choice  treatment 
Pooled  within  choice  treatment 
No-choice  treatment  within  class  i 
Choice  treatment  within  class  i 


Between  class,  choice 


Note 


Sample  range  for  G was  -2.68  to  2.71 
Sample  range  for  A'  was  -2.47  to  1.82 


Between-class , within-class , and  pooled  individual  regression  lines  for 
two  treatments  using  G as  aptitude  and  A'  as  outcome  (after  Greene,  1976) 


be  relatively  beneficial  to  High  G students  when  some  of  their  classmates  are 
seen  to  be  denied  it;  no-choice  may  be  beneficial  to  Low  G students  when  some 
of  their  classmates  are  seen  to  be  cut  adrift.  But  the  data  do  not  really  show 
this  effect  clearly;  the  distribution  of  no-choice  means  appears  restricted  on 
G ; there  seem  to  be  some  outliers,  and  Classes  4 and  8 in  the  middle  ability 
range  show  mean  differences  running  in  opposite  directions.  Besides,  to  test 
the  practical  value  of  such  a result  would  require  a design  in  which  whole  classes 
are  assigned  to  treatment,  since  it  does  no  educational  good  to  push  half  the 
students  up  and  half  down  within  a class  by  such  an  arrangement. 

Some  of  the  within-c lass  slopes  are  based  on  a dozen  or  so  points  and  are 
suggestive  of  further  effects.  Class  6,  a low  average  class,  shows  ATI  consis- 
tent with  the  between-group  slopes;  Low  G students  are  even  better  served  than 
are  High  G students  by  the  no-choice  treatment  within  this  class.  But  Class  7 
results  are  Inconsistent  with  the  between-group  implications.  In  this  high 
average  ability  class,  apparently,  choice  is  best  for  everyone,  but  particularly 
for  lower  ability  students.  Many  other  within-class  inconsistencies  are  notable. 

Conclusions  about  comparative  and  class  effects  are  not  justified  in  the 
Greene  study,  not  in  the  reanalysis  of  the  Anderson  study.  This  work  does  argue 
strongly,  however,  for  the  development  of  research  designs  and  statistical  meth- 
odology capable  of  separating  such  effects.  Cronbach's  (1976)  monograph  takes 
a major  step  in  this  direction. 

Out  1 iers . Both  the  Cronbach-Webb  treatment  of  the  Anderson  data,  and 
the  Greene  data,  show  examples  of  how  regression  slopes  can  be  profoundly 
affected  by  a few  deviant  data  points.  And  the  threat  of  outlier  dis- 
tortions increases  as  regression  slopes  are  based  on  fewer  cases.  This  is 
an  inevitable  problem;  investigators  will  usually  be  limited  to  a dozen  or 
so  classes  per  treatment,  with  about  25  students  per  class.  It  is  a 
problem  as  well  in  all  ATI  research,  whether  laboratory  or  school  based. 

But  what  is  an  outlier?  How  can  one  decide  that  a particular  data  point  is 
unnaturally  deviant?  Statistical  criteria  cannot  serve  alone.  Psychological 
research  requires  substantive  criteria.  In  any  case,  present  methods  are 
often  specialized  and  incompletely  developed.  (See  Klitgaard,  1976,  for  a 
useful  summary.) 

An  example  can  be  used  to  suggest  some  directions  methodological  develop- 
ment might  take.  It  also  serves  to  show  why  studies  of  long  duration  are 
required  to  understand  the  functioning  of  aptitude  in  school  learning. 
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In  an  evaluation  of  a new  curriculum  for  Dutch  first-year  (18  year  old) 
medical  students  (Wijnen  & Snow,  1975),  aptitude-outcome  relations  were  studied 
in  each  of  ten  four-week  instructional  blocks  over  the  year.  Instruction 

in  each  block  proceeded  by  problem-  or  case-oriented,  student-centered, 
small  group  discussion — a radical  <'■  parture  from  the  teacher-directed  lecture 

and  recitation  to  which  Dutch,  and  most  U.S.  secondary  students,  have  long 
been  accustomed.  While  the  obtained  bivariate  scatterplots  would  have  been 
judged  unusual  by  textbook  standards,  they  were  similar  to  some  plots  seen 
previously  by  the  investigators,  so  deviant  points  could  not  be  dismissed  as 
chance  outliers. 

Figure  6a  shows  scores  for  the  49  students  on  a verbal  reasoning  pre- 
test and  an  achievement  test  covering  the  first  four-week  block  of  instruction. 

The  aptitude  analysis  began  with  the  Verbal  Analogies  Test  because  it  yielded 
the  strongest  simple  relation  to  outcome  among  four  ability  tests  and  came 
closest  of  these  conceptually  to  the  definition  of  G.  The  regression  line 
shown  (corresponding  to  r=0.48)  is  fitted  to  all  points  taken  together. 

The  abnormality  among  high  ability  students  is  marked;  it  appears  as  if  two 
bivariate  distributions,  running  in  contrary  directions,  have  overlapped  to 
reduce  the  overall  regression. 

Figure  6abc  here 

An  informal  analysis  was  constructed  to  identify  underlying  variables 
that  might  account  for  the  results.  The  approach  taken  was  related  to  one 
proposed  by  Marks  ( 1964  ),  and  called  by  him  "off-quadrant  analysis".  In 
effect,  the  criterion  adopted  to  judge  outliers  was  one  of  predictability; 
points  whose  deviance  in  an  aptitude-outcome  scatterplot  can  be  predicted  by 
psychological  variables  that  make  sense  in  context  are  not  outliers.  It  was 
hoped  that  the  method  would  be  useful  in  identifying  aptitude  complexes  worthy 
of  more  direct  investigation. 

The  informal  analysis  worked  with  rules  for  fitting  ellipses  and  partition- 
ing the  scatterplots  into  area  groups,  labelled  by  letter  as  shown  in  Figure  6a. 
Then,  other  aptitude  information  (in  the  form  of  personality  scales  admin- 
istered at  pretest)  was  used  to  identify  variables  associated  with  the  B.  to 

2 1 

continuum,  as  well  as  with  other  area  group  contrasts.  Among  other — 

2 

This  procedure  was  developed  by  hand,  using  subjective  judgment.  A report 
on  the  method  and  analysis  of  the  present  data  is  in  preparation.  While 
formal  statistical  procedures  are  needed  to  replace  this  informal  method, 
that  step  awaits  the  completion  of  later  phases  of  this  multi-year  evaluation 
study,  when  replicates  of  the  present  scatterplot  will  be  available.  The 
purpose  here  is  only  to  show  the  initial  methodological  point,  not  to  draw 
conclusions. 
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Figure  6.  Scatterplots  for  three  instructional  blocks  using  analogies  test  scores 
as  aptitude  and  block  achievement  tests  1^  (a),  1^  (b),  and  I,.  (c) 
as  outcomes  (after  Wijnen  & Snow,  1975). 
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findings,  students  in  and  described  themselves  are  more  independent  and 
more  motivated  to  achieve  that  did  students  of  other  groups.  Together  with 
A students,  they  also  appeared  less  altruistic,  less  interpersonally  oriented, 
and  more  task  oriented  than  did  students  of  other  groups.  B^  students  also 
showed  the  highest  facilitating  anxiety  and  the  lowest  debilitating  anxiety 
scores.  B^  students  were  the  most  interpersonally  oriented  and  the  least 
task  oriented,  on  average,  of  all  the  groups.  The  resulting  hypothesis  is 
that  high  ability  students  who  are  also  highly  motivated  and  oriented  toward 
independent  task  activity  do  poorly  in  this  instructional  method,  since  it 
demands  a large  degree  of  cooperative  interpersonal  activity  in  learning. 

Middle  ability  students  who  value  the  interpersonal  interaction  perhaps  more 
than  the  task  activity  do  as  well  or  better  than  many  high  ability  students. 

A further  step  of  the  analysis  procedure  was  to  follow  these  students 
through  successive  four-week  blocks  of  instruction.  Figure  6b  and  6c  show 
example  scatterplots  for  two  later  blocks,  in  which  students  are  identified 
by  symbols  corresponding  to  their  area  groups  from  the  first  block.  The  B^ 
students  appear  gradually  to  adapt  to  the  instructional  situation,  emerging 
at  the  top  of  the  class  by  the  20th  week  (when  content  is  also  becoming  more  dif- 
ficult and  more  biomedical  in  character).  Some  B^ , B0 , and  A students  seem  to 
be  falling  out  of  the  distribution  across  this  same  sequence;  in  effect,  they 
are  becoming  outliers.  Perhaps  the  novelty  of  the  interpersonal  instructional 
situation  and  their  initial  success  in  it  is  not  sustaining.  Thus,  the  ap- 
titude complex  that  accounts  for  performance  at  an  early  stage  shifts  in 
later  stages  of  instruction;  what  at  first  appeared  to  be  an  aptitude  for 
this  kind  of  instruction  may  appear  later  to  be  an  inaptitude,  and  vice  versa. 

Although  different  measures  were  used  in  the  Dutch  situation  than  those 
of  Peterson  a..d  Porteus,  it  is  not  hard  to  see  the  students  as  comparable 
to  the  High  G,  High  P High  A^  + A^ , Low  A^  students  discussed  earlier. 
Instruction  was  student-centered  in  the  extreme,  but  it  was  socially-centered, 
which  apparently  conflicted  at  first  with  these  student's  independent,  task- 
oriented  styles  of  work,  until  they  adapted  to  it  or  overcame  it.  This 
emphasizes  again  the  importance  of  detailed  analysis  of  aptitudes  and  treat- 
ments over  time.  It  also  provides  an  indirect  argument  in  support  of  multi- 
variate aptitude  description  even  in  educational  experiments  and  evaluations 
not  specifically  concerned  with  aptitude. 
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Laboratory  Studies  of  Aptitude  Processes. 

Before  Binet's  success  in  predicting  school  learning  with  a heterogeneous 
general  test,  Galton,  McK.  Catt?ll,  and  others  had  built  measures  of  specific 
individual  differences  in  sensory,  perceptual,  and  memory  skills.  Their  work 
was  eclipsed  in  the  ensuing  development  of  mental  testing,  and  the  study  of 
individual  differences  in  learning-related  processes  has  been  a sporadic  and 
isolated  activity  in  psychology  ever  since.  Cronbach  (1957)  and  Glaser  (1967) 
have  traced  much  of  this  history.  The  Gagne  (1967)  symposium  signaled  the 
first  significant  return  of  laboratory  research  to  this  question.  With  the 
advent  of  information  processing  analvses  of  cognitive  tasks  at  about  the  same- 
time  (ileisser,  1967),  modern  cogniti.e  psychology  has  moved  increasingly  to  the 
application  of  this  view  to  the  study  of  human  intelligence  (See,  e.g..  Resnick, 
1975).  This  begs  the  more  general  issue  of  understanding  individual  differences 
in  aptitude  in  terms  of  psychological  processes.  The  need  for  this  was  noted 
earlier  in  connection  with  both  Tables  1 and  3.  The  time  finally  seems  ripe 
for  the  development  of  a coherent  laboratory  science  of  aptitude. 

But  this  field  is  too  new  and  far  flung  for  systematic  review  at  the 
present  time.  The  best  that  can  be  done  is  to  organize  a list  of  findings 
and  ideas  that  seem  to  point  in  useful  directions.  The  list  is  little  better 
than  an  annotated  bibliography;  it  includes  only  studies  appearing  since  the 
Gagne  symposium  that  suggest  relations  between  an  aptitude  construct  rel- 
evant to  this  chapter  and  some  kind  of  cognitive  process  measure.  Primary 

emphasis  is  on  analyses  of  G G,G  . Some  ATI  studies  conducted  in  the  labora- 
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tory  spirit  are  also  included.  Lett  aside  for  the  time  being  are  experimental 
studies  of  Ay,  many  process  analyses  of  laboratory  tasks  where  aptitude  construct 
were  not  considered,  and  studies  that  attempt  to  train  aptitude;  these  of  course 
must  ultimately  be  tied  into  the  network. 

An  attempt  has  been  made  to  order  citations  according  to  a "levels 
of  processing"  conception  of  cognition,  from  Initial  stimulus  processing 
through  reasoning  and  back  to  recall  and  response.  It  should  be  clear  that  this 
ordering  is  crude  and  at  t imes arbitrary . Also  to  be  noticed  is  a certain 
bidirectionality  in  the  research;  some  investigators  reason  from  aptitudes 
to  processes,  some  from  processes  to  aptitudes.  This  is  as  it  should  be; 
there  is  no  a priori  reason  to  consider  one  kind  of  measure  more  "basic"  than 
another . 


Initial  stimulus  processing.  Trabasso  and  Bower  (1968)  proposed  a model 

of  attention  in  learning,  including  a parameter  for  stimulus  sample  size,  i.e., 

the  amount  of  stimulus  information  a subject  takes  in  at  a single  glance.  They 

regard  this  as  a fairly  consistent  individual  difference  and  suspect  that  it 

relates  to  such  aptitudes  as  G,  field  independence  (or  G,),  and  A . The 
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Salomon  work  on  cue  attendance  mentioned  earlier  may  also  fit  here. 

Boersma  and  his  colleagues  (Conklin,  Muir,  and  Boersma,  1968;  Boersma,  e£ 
al,  1969)  demonstrated  that  extreme  groups  on  the  Hidden  Figures  Test  (G  ) 
differed  in  eye  movement  patterns  during  Hidden  Figures  item  solution,  and  also 
on  picture  completion  items.  High  Gf  subjects  attended  to  parts  of  pictures 
having  higher  information  value  than  did  Low  G^  subjects.  They  also  made  more 
fixation  shifts  between  simple  and  complex  figures  than  Low  G^.  subjects. 

Males  showed  longer  fixation  times  on  the  complex  stimulus  figure  than  did 
females,  even  though  the  latter  took  more  total  time  per  item. 

A series  of  studies  by  Hettema  (1967)  suggested  that  individuals 
differing  on  three  perceptual  ability  factors  differed  also  in  the  character 
of  their  information  reception  processes.  Dutch  Army  recruits  were  divided 
into  profile  groups  using  their  factor  scores  on  Flexibility  of  Closure, 

Speed  of  Closure,  Optical  Illusions,  and  Perceptual  Speed.  (The  first  two 
of  these  are  Thurstone  factors,  but  seemed  to  be  subdivisions  of  G in  this 
study).  Then,  in  a Helson-style  (1964)  weight-j udging  experiment,  it  was 
shown  that  adaptation  level  formulae  differed  for  subjects  differing  on  per- 
ceptual factors;  subjects  high  on  Closure  Flexibility  were  less  affected  by 
focal  stimulation,  those  high  on  overcoming  illusions  were  less  affected  by 
contextual  stimulation,  and  those  high  on  Closure  Speed  were  less  affected 
by  residual  stimulation.  Other  experiments  gave  the  following  results: 
subjects  high  on  Closure  Flexibility  showed  faster  reaction  times,  especially 
under  high  focal  uncertainty  conditions;  subjects  high  on  Closure  Speed  were 
more  apt  to  find  short  solutions  to  Luchins  Jar  problems,  while  subjects  low 
on  this  factor  more  often  opted  for  long  solutions. 

Day  (1969;  1973a),  using  dichotic  listening  tests  in  which  parts  of  words  are 
fed  to  different  ears,  has  found  that  subjects  differ  strikingly  and  stably  in 
whether  they  are  language-bound  (fusing  the  two  inputs  to  report  a word)  or 
stimulus-bound  (reporting  the  stimuli  without  combining  them.)  The  individ- 
ual difference  distribution  was  actually  bimodal.  The  language-bound  subjects 
appear  to  be  poor  judges  of  temporal  order,  while  stimulus-bound  subjects  are 
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good  at  this.  And,  in  memory  span  tests,  language-bound  subjects  show  marked 
serial  position  effect  while  stimulus-bound  subjects  show  no  serial  position 
effect.  In  general,  stimulus-bound  subjects  obtain  higher  memory  span  scores. 

Another  Day  study  ( 1973b) taught  a secret  language  to  subjects  and  then 
measured  their  ability  to  translate  words  into  English.  Secret  languages  use 
letter  substitution  or  transformation  rules  of  the  kind  found  in  pig  latin. 
Language-bound  subjects  were  shown  to  make  far  more  errors  in  word  translation 

I 

than  stimulus-bound  subjects  and  to  give  more  global,  whole  word  translations 
as  opposed  to  sequential,  syllable-by-syllable,  responses. 

These  and  other  data  suggest  qualitative  differences  in  memory  associated 
with  this  individual  difference.  Day  believes  that  stimulus-bound  subjects 
translate  to  visualizations  (Gv?)  while  language-bound  subjects  do  not,  and 
that  the  former  have  a faster  readout  mechanism  for  transferring  stimulus  in- 
formation from  sensory  register  to  short  term  storage. 

This  corresponds  roughly  to  correlational  studies  by  Seibert  and  Snow 
(1965;  see  also  Snow  & Seibert,  1966),  where  the  erasure  or  backward-masking 
phenomenon  in  short  term  visual  memory  seemed  to  influence  subjects  differently 
depending  on  their  perceptual  and  verbal  ability  profiles.  Perceptual  identifi- 
cation tests  and  G measures  accounted  for  individual  differences  in  erasure  at 
c 

different  time  delay  intervals.  For  subjects  high  in  both  kinds  of  ability,  the 
masking  effect  was  less  marked. 

Rothkopf  (1972)  has  studied  individual  differences  in  inspection  rates 
during  reading,  changes  in  these  rates  as  a function  of  inserted  questions, 
and  their  relation  to  learning  outcomes.  These  rates  might  also  reflect 
stimulus  sample  size,  speed  of  registration,  or  transfer  to  short  term  memory. 

In  one  study,  he  found  that  subjects  with  low  performance  c-n  early  segments 
of  text  and  who  accelerated  their  reading  rate  during  this  period,  benefitted 
from  questions  inserted  in  later  segments  of  text.  Reading  speed  character- 
istics, in  general,  may  provide  a fundamental  avenue  for  research  relating 
aptitude  and  learning  processes  (see  also  Sticht,  1971).  There  is  other 
evidence  that  inserted  questions  sometimes  help  low  ability,  particularly  low 
memory-ability  subjects,  but  can  be  detrimental  to  Highs.  The  evidence  on 
inserted  questions  in  relation  to  G is  quite  contradictory,  however  (see  Table  2). 


Sullivan,  et  al.  ( 1971)  studied  phonics  training  for  children  who  differed 
on  a pretest  of  reading  skills.  He  found  the  kind  of  ATI  that  suggests  a 
hierarchical  arrangement  of  discrimination  skills.  Training  on  single  letters 
was  best  for  low  pretest  students  while  training  on  letter  combinations  was 
best  for  students  with  more  initial  skill. 

Cromer  (1970)  distinguished  between  "deficit"  poor  readers  (those  unable 
to  identify  words)  and  "difference"  poor  readers  (those  unable  to  integrate 
separate  meanings  in  sentences),  using  vocabulary  and  reading  score  profiles. 

He  then  demonstrated  that  printing  sentences  in  phrase  segments  allowed  differ- 
ence-poor readers  to  teach  the  comprehension  levels  of  good  readers.  Pre- 
sumably segmenting  and  clustering  the  sentences  into  spacedmeaningful  chunks 
compensated  for  the  difference-poor  readers'  inability  to  do  so.  since  these 
students  apparently  tend  to  read  word  by  word.  For  deficit-poor  readers, 
printing  sentences  as  vertical  word  lists  increased  comprehension,  though 
their  comprehension  was  still  well  below  that  of  good  readers.  Presumably 
here,  the  word  lists  overcame  the  deficit-poor  readers'  tendency  to  skip 
words  they  were  unsure  of  by  focusing  attention  on  each  word  in  turn.  It 
should  be  noted  that  fragmenting  sentences  randomly  or  printing  them  as 
word  lists  disrupted  good  readers'  processing.  They  did  best  with  con- 
ventional sentences.  Here  may  be  an  example  of  segmentation,  a type  of 
mathemagenic  event  originally  hypothesized  by  Rotnkopf  (1965).  Note, 
however,  that  different  kinds  of  stimulus  segmentation  are  needed  for 
students  with  different  ability  profiles. 

Short  Term  Mamory.  Estes  (1974)  advocated  detailed  experimental  analyses 
of  the  tasks  typically  found  in  intelligence  tests,  and  reviewed  past  research 
on  several  examples  of  this.  He  listed  various  sources  of  individual  differ- 
ences on  digit  span  tasks,  such  as  familiarity  with  ordinal  numbers,  grouping 
strategies,  coding  processes,  and  capacity  for  selective  inhibition.  On  the 
digit-symbol  substitution  part  of  the  WAIS,  he  noted  prior  research  indicating 
the  importance  of  distinctive  verbal  coding  in  memory.  Vocabulary  test  per- 
formance was  hypothesized  to  depend  on  the  structure  of  long  term  memory  built 
through  verbal  experience,  but  also  on  accessibility  and  retrieval  strategies, 
and  on  the  individual's  notion  of  what  constitutes  an  acceptable  answer.  Word 
naming  tests  were  likened  to  free  recall  processes,  suggesting  the  importance 
of  organization  and  category  usage  in  verbal  recall.  Process  theories  of  such 
tasks  could  lead  to  process  theories  of  G , and  to  improved  measures  for  diag- 
nosis and  remediation. 
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Other  recent  studies  relate  to  some  of  these  suggestions.  Mason,  Katz, 
and  Wicklund  (1975)  contrasted  memory  for  order  and  memory  for  items  in  a 
modified  letter  span  task.  Order  memory  appeared  to  be  more  highly  related  to 
reading  ability  (G^?)  than  item  memory.  And,  Jensen  and  Figueroa  (1975)  per- 
formed several  studies  relating  to  the  Jensen  Level  I-Level  II  ability  distinction. 
As  predicted,  backward  digit  span  scores  were  more  highly  related  to  WISc-IQ 
(G^.Gj,  and  G)  than  were  forward  digit  span  scores.  Differences  in  these  re- 
lations were  stronger  for  black  than  for  white  subjects.  The  interpretation 
suggests  that  backward  digit  span  requires  more  mental  processing  steps  than  does 
forward  digit  span  and,  hence,  is  closer  to  Level  II  transformation  ability. 

Note  also  its  appearance  with  G^  tests  in  the  Das-Molloy  factor  study  (Table  3). 

A program  of  studies  by  E.  Hunt  (See,  e.g..  Hunt,  Frost,  & Lunneborg,  1973) 

has  sought  to  connect  Gc  to  various  parameters  representing  speed  and  sequence 
of  short-term  memory  processing.  In  a visual  matching  task.  High  G^ 
subjects  were  found  to  be  faster  at  name  matching,  but  not  at  physical 

identity  matching,  than  subjects  with  lower  G scores.  Another  experiment 

c 

used  a continuous  paired-associates  task,  suggesting  that  related  to 
parameters  defined  as  showing  probability  of  entry  of  an  item  into  short  term 
memory,  and  negative  rate  of  loss  of  items  from  an  intermediate  memory  storage. 

In  two  other  studies.  High  G^  subjects  showed  more  release  from  proactive 
inhibition  in  word  recall  and  shallower  slope  parameters  (i.e.,  faster  memory 
search)  in  a Sternberg  memory  search  task,  than  did  subjects  with  lower  G^ 
scores.  Hunt  hypothesizes  that  verbal  ability  is  associated  with  speed  of 
coding,  order-preserving,  and  search  operations  in  short-term  memory,  but  his 
ability  measures  cannot  be  sat isf actorily  interpreted  as  verbal  specifically; 
they  represent  G^  (See  Snow,  1976.). 

Chiang  and  Atkinson  (1976,  in  press)  pursued  Hunt's  findings  on  the  vis- 
ual and  memory  search  tasks.  They  failed  to  demonstrate  significant  overall 
correlations  between  SAT  scores  (G^)  and  the  slope  parameters  indicating  speed 
of  search.  However,  striking  relations  appeared  when  the  sample  of  subjects 
was  separated  by  sex.  Higher  G^  scores  were  associated  with  faster  memory 
search  for  males  (consistent  with  Hunt),  while  for  females  the  opposite  relation 
obtained;  here  however  the  scatterplot  contained  some  strange  outliers. 
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Mediation  and  transformation.  Studies  by  Keislar  and  Stem  (1970)  com- 
pared single  hypothesis  vs.  multiple  hypothesis  strategies  in  concept  attainment. 
High  G children  performed  better  using  multiple  hypotheses  but  Low  G children 
did  best  with  the  single  hypothesis  method. 

Cason  (1972)  has  succeeded  in  developing  measures  of  process  individual 
differences  in  concept  attainment  for  hypothesis  generation,  hypothesis  eval- 
uation, and  memory  for  hypotheses.  But  these  have  not  yet  been  related  to 
mental  ability  measures  or  shown  to  be  relevant  to  experimental  manipulations. 

A line  of  studies  by  Rohwer  has  investigated  various  verbal  and  pictorial 
elaborations  in  associative  learning,  and  their  relation  to  age,  race,  and  SES 
variables.  It  is  thought,  for  example,  that  age  effects  are  due  to  older  sub- 
jects using  more,  or  more  highly  developed,  mental  elaborations  (such  as  sentences) 
to  connect  members  of  the  pairs  to  be  learned.  Age  * treatment  interactions 
have  been  shown,  as  well  as  marked  individual  differences  in  elaborative  activity 
(Rohwer  and  Bean,  1973).  In  more  recent  studies  (Rohwer,  et^  al,  n . d . ),  bo  th  G^ 
and  a specific  measure  of  individual  differences  in  paired-associate  learning 
have  been  studied  in  relation  to  the  elaboration  hypotheses.  It  was  reported 
that  elaborative  propensity  varies  reliably  as  an  individual  difference  in 
adolescents,  can  be  prompted  by  treatments  providing  different  kinds  of  elabor- 
ative instructions,  and  seems  only  weakly  related  to  G^.  There  was  an  implication 
in  one  experiment  that  repetition  instructions  (designed  to  impede  elaboration) 
were  particularly  harmful  for  subjects  with  high  pretest  scores  on  paired- 
associate  learning  proficiency;  this  may  be  another  case  of  the  interference 
phenomenon  noted  in  a previous  section  of  this  chapter . 

Imagery  seems  to  aid  associative  learning  as  another  form  of  elaboration. 

How  imagery  helps  has  been  the  subject  of  research  by  Paivio  (1974),  DiVesta, 
et  al.,  (1971),  and  many  others.  Paivio  interprets  the  data  as  suggesting  that 
imagery  and  verbal  functions  represent  a dual  processing  system.  There  is  evi- 
dence that  individuals  differ  in  the  use  and  efficiency  of  the  two  modes  of 
mediation.  In  much  of  this  work,  however,  subjects  who  are  High  vs.  Low  in 
imagery  have  been  identified  using  spatial  ability  tests.  Image-generation  is 
probably  one  of  the  processes  wrapped  up  in  G^.  Kerst  and  Levin  (1973)  have 
looked  at  individual  differences  here,  showing  that  imagery  and  sentence-mediators, 
both  experimenter-given  and  learner-generated,  are  equally  facilitative  of  reten- 
tion, but  that  learner-generated  mediators  display  markedly  greater  individual 
differences.  Levin  (1973)  also  gave  stories  in  print  vs.  pictorial  form  to 
readers  who  had  been  classified  as  good  readers,  difference-poor  readers  (weak 
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in  comprehension),  or  deficit-poor  readers  (weak  in  both  vocabulary  and  compre- 
hension), following  a distinction  made  by  Cromer  citer  earlier.  Half  of  the 
print  group  were  also  given  imagery  instructions.  Reading  with  imagery  facili- 
tated the  comprehension  of  difference-poor  readers,  and  also  of  good  readers, 
but  not  of  deficit-poor  readers.  The  latter  group  did  best  with  pictures. 

Part  of  the  research  on  imagery  vs.  verbal  elaborations  stemmed  from  early 
findings  that  pictures  were  better  than  words  as  stimuli  in  paired-associate 
learning.  Rohwer  and  Harris  (1975)  have  gone  on  to  suggest  that  combinations  of 
media  (e.g.  , pictorial  augmentation  of  oral  prose)  might  be  particularly  bene- 
ficial for  members  of  some  populations  (low  SES  black  children),  but  not  neces- 
sarily for  other  populations.  Meanwhile,  Levin,  et  al.,  (1974)  developed  a 
paired  associate  learning  test  to  identify  subjects  who  learned  relatively  well 
from  pictures  vs.  words.  It  was  then  shown  that  learners  who  were  good  with 
pictures,  but  poor  with  words,  did  significantly  better  when  given  picture  pairs 
instead  of  word  pairs.  A second  experiment  used  a prose  reading  task  given  under 
regular  vs.  imagery  instructions,  to  compare  comprehension  of  the  passage  by  good 
picture-learners  and  poor  picture-learners.  While  no  differences  among  the  group 
occurred  under  regular  reading  instructions,  the  former  subjects  were  helped,  and 
the  latter  apparently  impeded,  by  instructions  to  use  imagery. 

Coffing  (1971)  replicated  one  early  experiment  by  Rohwer  (1967)  that  had 
compared  pictures  and  words,  adding  measurement  of  individual  differences  in  eye 
movements.  His  data  suggested  that  pictures  facilitate  learning  only  for  subject 
who  tend  to  scan  pictures  rather  than  words  when  given  a choice.  Those  who 
seem  to  prefer  to  look  at  the  words  showed  no  differences  in  learning  outcome 
between  conditions.  Also  of  note  is  a study  by  Hall  (1974)  suggesting  that 
eye  movement  patterns  relate  to  the  subject's  use  of  iconic  imagery.  During 
recall  of  letters,  eye  movements  tended  to  scan  the  spatial  positions  where  the 
letters  had  previously  appeared.  These  studies  tie  back  in  with  those  mentioned 
in  the  initial  stimulus  processing  section.  Individual  differences  in  eye  move- 
ments may  deserve  attention  as  one  important  source  of  clues  about  processing 
activities  in  several  stages  of  learning. 

Perhaps  the  simplest  example  of  an  experiment  designed  to  examine  ATI  in  the 
laboratory  comes  from  the  work  of  Gavurin  (1967).  This  and  a follow-up  study  by 

Frandsen  and  Holder  (1969)  suggest  that  Gv  reflects  in  part  skill  in  mentally 
transforming  and  thus  solving  verbal  problems.  Gavurin  gave  some  subjects  anagram 

problems  under  standard  conditions  and  allowed  other  subjects  to  work  the  problems 
by  moving  the  printed  letters  about  on  the  table  top  to  try  out  alternative  arrange- 
ments. A spatial  test  correlated  strongly  with  problem  solving  under  the 
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nonmanipulation  condition  and  essentially  zero  when  subjects  were  allowed  to 
manipulate  the  letters.  Prohibiting  manipulation  apparently  forces  reliance 
on  a kind  of  internal  processing  at  which  High  G^  subjects  excel. 

Frandsen  and  Holder  followed  by  demonstrating  strong  correlation  between 
G^  and  verbal  problem  solving,  using  syllogisms,  time-rate-distance  and  de- 
duction problems,  even  when  G was  partialled  out.  They  then  showed  that 

c 

training  in  use  of  diagrammatic  techniques  like  Venn  diagrams  and  time  lines 
brought  the  problem  solving  performance  of  Low  G^  subjects  up  to  that  of  High 
G subjects. 

Shephard  and  his  coworkers  (see,  e.g.,  Shepard  and  Feng  1972)  have  analyzed 

the  processes  involved  in  many  tasks  comparable  to  those  used  to  measure  G . 

Their  work  has  not  been  concerned  with  individual  differences,  but  implies 

that  G represents  differences  in  the  construction  and  transformation  of  mental 
v 

analogs,  and  speed  therein.  While  Shepard's  work  thus  seems  to  argue  aga-inst 

a sequential  feature  comparison  account  of  G^,  French  (1965)  has  shown  that 

subjects  divide  between  these  two  strategies  on  the  basis  of  self-report,  at 

least  on  some  spatial  tasks.  As  noted  earlier  then,  the  hypothesis  is  that 

some  subjects  solve  spatial  rotation  problems  by  visual  analog  transformations 

(Gv);  others  solve  them  by  a process  of  logical -sequent ial  matching  of  stimulus 

features  (Gr  or  G ) . 

t c 

Contrasts  between  G and  G , or  between  imagery  and  verbal  processing 
v c 

systems,  prompt  hypotheses  that  relate  these  distinctions  to  cerebral  hemis- 
pheric structure.  The  Das  conjectures  about  simultaneous  vs.  successive 
processing,  or  the  Pask-Scott  distinction  between  holist  and  serialist  processing, 
both  noted  earlier,  are  similarly  suggestive.  Evidence  to  date  suggests  that 
the  left  cerebral  hemisphere  concerns  verbal-analytic  processing,  while  spatial 
image  processing  may  be  more  of  a right  hemisphere  function.  A review  of  re- 
search on  cerebral  hemispheric  assymetry  by  Lohman  (1976)  shows  reliable 
individual  differences  in  lateralization  of  language  processes,  with  average 
differences  most  notable  between  sinistrals  and  dextrals,  males  and  females, 
and  children  and  adults.  It  remains  to  be  seen  how  far  neurophysiological 
measures  '’an  be  pushed  to  help  clarify  psychological  process  conceptions  of 
such  individual  differences. 
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Reasoning  and  problem-solving.  In  a rare,  early  analysis,  Simon  and 
Kotovsky  (1963;  see  also  Kotovsky  and  Simon,  1973)  produced  a computer  simu- 
lation of  performance  on  the  Thurstone  Letter  Series  Test.  Their  model 
distinguished  two  processes,  pattern  analysis  and  sequence  generation,  but 
dealt  with  individual  differences  only  indirectly,  in  a few  variants  on  the 
basic  model.  These  deserve  research  attention,  however.  Other  investigators 
have  begun  similarly  detailed  analyses  of  other  tests  usually  considered 
fundamental  to  the  definition  of  G (or  G^.)  . 

Whitely  (1976)  studied  a large  pool  of  verbal  analogy  test  items,  dividing 
them  by  a cluster  analysis  procedure  into  eight  types  of  relational  concepts. 

She  then  administered  these,  along  with  a measure  of  eduction  of  multiple  re- 
lationships for  noun  pairs  and  six  other  ability  tests,  to  college  students. 
Predicting  ability  test  scores  from  multiple  regression  equations  made  up  of 
analogy  item  types  suggested  that  different  categories  of  analogy  items  tap 
different  abilities.  The  eduction  of  relationships  measure  also  showed  this 
pattern.  But  relations  between  measures  were  generally  low  and  the  correlational 
analysis  was  incomplete. 

R.  Sternberg  (1975)  has  conducted  the  most  comprehensive  review  and  analysis 
of  a mental  test  performance  yet  reported.  His  empirical  work  dealt  with  an- 
alogies, but  the  report  includes  a valuable  review  of  differential  and  information 
processing  theory  and  research  relevant  to  a theory  of  G (or  G^).  Sternberg  also 
developed  his  own  "componential"  theory  of  analogical  reasoning,  with  a detailed 
task  analysis  to  identify  specific  components  of  this  form  of  reasoning,  their 
combination  rules,  and  forms  of  processing.  All  this  material  cannot  be  synop- 
sized  here.  While  portions  are  discussed  by  Snow  (1976)  in  relation  to  other 
theoretical  and  methodological  issues  the  reader  is  urged  to  examine  the  original 
report  in  detail . 

In  brief,  Sternberg’s  analysis  produces  separate  reaction  time  measures  for 
six  components:  encoding,  inference,  mapping,  application,  justification,  and 

preparation-response.  Various  alternative  models  can  be  used  to  combine  these. 

A series  of  three  experiments  presented  pictorial,  verbal,  and  geometric  analogies 
to  extreme  groups  on  Gf  and  perceptual  speed,  along  with  other  G^  and  analogy 
reference  tests.  Results  suggested  that:  all  subjects  use  all  components  except 

justification,  which  is  included  by  some  subjects  but  not  others;  individual 
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differences  arise  from  differing  component  times,  not  from  different  com- 
bination rules  or  models;  G sometimes  correlates  positively  with  encoding 
time,  suggesting  a strategy  difference  whereby  more  time  taken  by  Highs  in 
encoding  makes  later  component  operation  more  efficient;  correlates  negatively 
with  time  for  each  later  component,  but  only  under  specified  conditions,  as 
e.g. , when  inference  and  mapping  involve  discovery  of  relevant  attributes 
rather  than  just  their  testing;  perceptual  speed  sometimes  correlates  negatively 
with  encoding  time,  but  this  reference  ability  is  generally  not  associated  with 
components  of  the  theory. 

A few  other  studies  can  be  inserted  here  for  future  reference.  Jacobs 
and  Vandeventer  (1971a;  1971b),  Guinaugh  (1969),  and  Turner  (1975)  have  all 
conducted  training  experiments  on  Raven  Progressive  Matrices  (G^.) . These 
should  help  guide  process  analyses  of  this  task  of  the  sort  that  Sternberg 
applied  to  analogies. 

Strategies  and  structure.  Research  by  Dunham  and  Bunderson  (1969) 
connects  to  some  extent  with  the  Greeno-Mayer  work  reviewed  earlier.  It 
also  introduces  research  distinguishing  G from  memory  abilities  and  raises 
the  large  issue  of  compensatory  plans  or  strategies  for  using  one's 
abilities.  One  of  their  investigations  showed  that  a principle-structured 
treatment  depended  particularly  on  reasoning  ability  (G^.?),  while  a general 
orientation  or  relatively  unstructured  treatment  depended  on  memory  ability, 
using  simple  concept-attainment  tasks.  The  implication  is  that  what  ability 
determines  success  depends  on  the  way  one  stores  incoming  information.  If 
one  must  remember  isolated  bits  of  information,  memory  ability  is  the  necessary 
condition  for  learning.  If  one  can  sort  the  relevant  information  by  applying 
some  rational  structure  or  principle  then  only  this  information  needs  to  be 
remembered.  Memory  load  is  reduced  but  now  the  ability  to  reason  is  the 
necessary  condition.  Each  treatment  serves  a compensatory  function;  each 
allows  the  learner  to  substitute  an  ability  he  has  for  one  he  lacks. 

Dunham  (1969)  reported  a similar  finding  comparing  a condition  where 
pairs  of  stimuli  presented  together  were  always  instances  of  the  same  con- 
cept, with  a condition  where  pairs  were  instances  of  different  concepts. 

Reasoning  ability  correlated  with  performance  in  the  "same"  condition  while 

memory  ability  correlated  with  performance  in  the  "different"  condition. 

Merrill  (1974)  added  the  finding  that  a negative  relation  of  G^.  to  test 

item  response  latency,  found  in  an  examples-only  condition,  was  reduced  under 
conditions  where  objectives  and/or  rules  were  provided.  The  task  was  brief 
CAI  instruction  about  an  imaginary  science. 
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Labouvie,  et  a 1 (1973)  used  an  immediate  recall  vs.  delayed  recall  contrast, 
with  picture  stimuli.  A series  of  10  pictures  were  shown  on  each  trial,  with  i 
letter  cancellation  task  performed  in  the  delay  interval.  Subjects  wer? 
college  students.  Memory  ability  measures  correlated  with  recall  performance 
primarily  on  early  blocks  of  trials  in  the  immediate  recall  condition.  C 
measures  correlated  with  performance  mainly  on  late  trial  blocks  in  the  delay 
condition.  In  a later  series  of  experiments,  Labouvie-Vief , et^  al^  (1975)  tried 
to  replicate  this  finding.  One  study  gave  a similar  recall  task  under  three 
conditions:  standard  free  recall,  chunking  instructions,  rote  rehearsal  in- 
structions. Here  subjects  were  from  Grades  7 and  12.  G correlated  with  recall 
performance  on  all  trials  in  all  conditions  at  both  age  levels.  Memory  measures 
correlated  with  nothing.  Another  study  used  subjects  from  Grades  3 and  10, 
using  a recall  task  and  delay  vs.  no  delay  conditions  like  that  in  the  first 
study.  Raven  (G^)  and  digit  span  were  the  only  ability  measures  (whereas 
earlier  work  had  included  multiple  measures  and  factor  scores  for  the  aptitude 
constructs).  Results  here  showed  scattered  significant  correlations  of 
recall  with  G^  but  none  with  memory  span.  Finally,  a year  later  the  third 
graders  above  performed  a paired  associate  task.  This  was  given  under  conditions 
designed  to  promote  speeded  rote  memory  vs.  an  imagery  strategy  in  learning 
the  pairs.  G correlated  significantly  with  performance  in  the  condition? 
prompting  use  of  an  imagery  strategy  and  zero  in  the  speeded  rote  condition 
Digit  span  gave  moderate  nonsignificant  correlations  in  all  conditions. 

Thus,  G or  G^  seems  relevant  to  recall  performance  under  various 
conditions  at  various  ages,  perhaps  especially  when  subjects  can  choose 
or  are  prompted  to  use  some  organizing  strategy.  Memory  ability  showed  no 
consistent  relevance. 

There  have  also  been  studies  of  individual  differences  in  subjective 
strategies  in  word  recall.  Frederiksen  (19f>9)  found  that  performance  it 
recall  was  positively  related  to  G^  and  memory  span,  and  negatively  relate! 
to  the  learner's  self-reported  tendency  to  modify  strategies  during  learning. 

When  subjects  had  to  anticipate  words  in  clusters,  those  high  in  fluency 
and  low  in  semantic  flexibility  and  using  a strategy  of  active  sequential  or- 
ganization, did  best.  In  a serial  anticipation  condition  , only  use  of  a 
mnemonic  strategy  was  related  to  success. 

Cohen  (1973)  gave  paired  associates  to  college  students  who  reported  the 
organizing  strategies  they  used  for  each  item, after  studying  and  recalling 
several  lists.  They  were  then  assigned  to  conditions,  designed  to  promote 
repetitive  (rote),  formal  (using  common  letter  patterns),  or  experimental 
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(using  familiar  ideas)  organizing  strategies.  A control  group  was  told  to  use 
any  preferred  procedure.  It  was  found  that  subjects  who  naturally  used  more 
high  level  strategies  (formal  or  experimental)  showed  higher  recall  performance 
than  did  subjects  preferring  repetitive  or  unconscious  strategies.  The  latter's 
performance  improved  under  instructions  to  use  high  level  strategies.  Ability 
measures  were  not  included  however. 

Scattered  other  studies  have  investigated  individual  differences  in 
subjective  organization  in  recall,  but  have  not  been  concerned  with  aptitude 
constructs.  They  are  difficult  to  interpret  in  the  context  of  this  chapter. 

And,  Sternberg  and  Tulving  (in  press)  have  shown  that  different  measures  of 
subjective  organization  in  free  recall  have  different  psychometric  properties, 
correlate  differently  with  recall,  and  that  these  differences  account  for 
disparity  among  the  results  of  a range  of  experiments. 

Response  integration  and  retention.  There  is  little  to  go  on  regarding 
process  individual  differences  in  response  integration,  outside  of  psychomotor 
research  (For  this,  see  Fleishman  , 1975k  ) • But  Sternberg  hypothesized  a few 
additional  components  in  his  preparation-response  category  to  account  for  an 
unexpected  result  in  his  analogy  experiments.  The  component  score  for  prepar- 
ation-response was  highly  correlated  with  G^.,  suggesting  that  preparation  and 
response  need  to  be  separated.  Then# such  a correlation  might  be  found  to  arise 
from  individual  differences  in  planning  at  the  start  of  solution,  "bookkeeping" 
during  solution,  or  a decision  process  occuring  near  the  end  of  solution.  In- 
dividual differences  in  motivation  or  attention  throughout  solution  were  also 
noted  as  possible  explanations.  One  can  imagine  all  these  processes  involved 
in  more  general  learning  and  problem  solving,  but  it  seems  that  "response  in- 
tegration" still  would  deserve  more  detailed  analysis. 

Recent  studies  suggesting  process  conceptions  of  individual  differences  in 
long-term  retention  have  not  been  found.  There  are  a few  implications  to  be 
noted  in  the  Gagne*  (1967)  symposium,  and  in  Cronbach  and  Snow.  And  some  recent 
work  on  short-term  retention  suggests  that  learners  who  differ  in  their  rates 
of  initial  learning  nonetheless  forget  at  the  same  rate  (Shuell  and  Giglio, 
1973).  If  there  are  individual  differences  in  long  term  retention,  they  should 
relate  to  G^  , since  knowledge  accumulation  and  use  over  long  intervals  is 
part  of  the  definition  of  that  construct. 
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Prospectus 

Instructional  studies  of  ATI,  methodological  improvements  for  such 
studies,  and  process  analyses  of  aptitude^ constitute  a three-fold  path  toward 
the  integration  of  experimental  and  differential  psychology  in  educational 
research  and,  hopefully,  toward  improved  theory  and  practice  to  explain  and 
use  aptitude  for  instruction.  The  first  aspect  of  this  path  attacks  ATI 
phenomena  in  school  learning  directly,  the  second  develops  the  means  to 
understand  ATI  data  at  this  level,  and  the  third  seeks  to  transmit  to  such 
research  improved  process  models  and  measures  of  aptitude.  Continuing  work 
relevant  to  each  aspect  will  likely  deserve  its  own,  more  intensive  review 
in  the  coming  years.  A detailed  summary  of  this  chapter  here  thus  seems 
unnecessary  as  well  as  impossible. 

In  the  past,  individual  investigators  have  usually  concentrated  on  school 
studies,  or  on  laboratory  work,  or  on  methodology,  as  their  aptitudes  have 
moved  them.  The  three-fold  path  conception  of  this  chapter  is  meant  to  suggest 
that  all  three  aspects  need  to  be  addressed  in  the  future,  in  coordinated  re- 
search programs  on  common  ATI  phenomena. 

Educational  researchers  interested  in  this  problem  can  find  many  different 
hypotheses  on  which  to  focus  their  work.  This  chapter  has  suggested  that  the 

A. A A and  G G rC,  apLitude  complexes,  and  their  interconnections,  deserve  par- 

i c x c f v r 

ticular  attention.  Educational  researchers  not  interested  in  this  problem  can 
ignore  it.  But  individual  differences  in  aptitude  for  learning  will  not  go 
away . 
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